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HOKMJ KNDOKKTRIAIi SPECIFIC 
STEROID -BINDING FACTOR I, II and III 

This invention relates, in part, to newly identified 
polynucleotides and polypeptides; variants and derivatives of the 
polynucleotides and polypeptides; processes for making the 
polynucleotides and the polypeptides, and their variants and 
derivatives; agonists and antagonists of the polypeptides; and uses 
of the polynucleotides, polypeptides, variants, derivatives, 
agonists and antagonists. In particular, in these and in other 
regards, the invention relates to polynucleotides and polypeptides 
of human endometrial specific steroid-binding factor I, II and III, 
sometimes hereinafter referred to as "hBSF I, II and III". 

BACKGROUND OF THE INVENTION 

The regulation of cells and tissues is controlled by autocrine 
and paracrine factors, such as systemic hormones and factors that 
modulate or mediate the action of hormones. 

Many peptides, expressed locally, can influence certain 
biological activity in the mammalian system and are very important 
in the regulation of cells of the epithelium. These factors 
largely have not been identified or characterized, particularly not 
in humans. 

A few factors that play a role in the regulation of functions 
of the lung and uterus, both adult and fetal, have been identified 
in non-human organisms. One such factor is mammalian CC10, i.e., 
human, rat and rabbit CC10. (Wolf, M. et al., miman Molecular. 
Genetics . l(6):37l-378 (1992)). Clara Cell 10 kDa secretory 
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protein tCCIO) which is also called Clara Cell 17 kDa protein, is 
a hotnodimer consisting of 8.5 kDa monomers that are joined by two 
disulfide bonds (Umland, T.C. et al., HP?,, BAqJ,,, 224:441-448 
(1992)). It is the predominant secreted protein of lung Clara 
cells which are the lining of the bronchiolar epithelium (Singh, 
G. and Katyal, S.L., -1- Histochem CVtochem. . 32:49-54 (1984)). 
The physiological role of the protein is not yet completely 
understood. It has been reported that CC10 specifically binds 
raethylsulfonyl-polychlorated biphenyls (PCBs) (Nordlund, Moler, L. 
et ml., -t m»l. Chem. . 265:12690-12693 (1990)) and inhibits 
phospholipase A, (Singh, G. et al., giPChem, 3iPPhYg , Acta , 
1039:348-355 (1990)). In the last few years the sequences of rat 
(Katyal, S.L. et al.. P™» ggmir. Res.. 25:29-35 (1990); and 
Hagen, G. et ml.. Witt?**" * eids Res - 18:2939-2946 (1990)), and 
human (Singh, G. et al., ftipqlMffl, PiftPhYg, Agta. 950:329-337 (1988) 
CdO cDNAs have been reported. cDNAs, and the derived amino acid 
sequences, show striking homologies to rat uteroglobin (Singh, G. 
etal., i ft<mfw. Acta. 1039:348-355 (1990); and Hagen, G. 

ec al., r*^*ir Acids Res.. 18:2939-2946 (1990)). 

Like CC10, rat uteroglobin is a covalently bound homodimer 
whose three dimensional structure is well known (Morize, I. et al., 
. 7 Moi. Biol. . 194:725-739 (1987). Uteroglobin egression in 
rabbits has been originally reported in the uterus during the 
preimplantation phase (Beier. H.M., B^chem . , BjQphyg. Actft . 
160-289-290 (1968) ) . More recently, the protein was also detected 
in oviduct (Kirchner. C. cm T*MW Rgf.. 170:490-492 (1976)), 
male genital organs (Beier. H.M. et al.. Cell TlSffue 165:1-11 
(1975)), esophagus (Noske. I.G. and Peigelson, M., ftlftl, Reprpfl, , 
15:704-713 (1976)) and lung (Noske, supra; and Torkkeli, T. etal., 
pi»nhv B . Acta. 544:578-592 (1978)). 
in vitro, several distinct properties of uteroglobin have been 
described. Soon after its discovery it could be shown that the 
steroid hormone progesterone is specifically bound by the protein 
(Beato, M. and Baier. R. » ftlTKhfim, Bigphyg. ft«a ,. 392:346-356 
(1975); and Beato, M. et al., J, St^riod BlQChffl.. 8:725-730 
(1977)). Therefore, rabbit uteroglobin was believed to be a 
potential carrier or scavenger of progesterone that regulates the 
progesterone concentration in the endometrium (Atger, M. et ml.. 
■t c^HBioehm. . 13:1157-1162 (1980))- It has also been shown 
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to specifically bind certain methylsulf onyl metabolites of 
polychlorinated biphenyls with even higher affinity than 
progesterone (Gillner, M. et al., J. Steroid Biochem. . 31:27-33 
(1988)). Furthermore, uteroglobin has been found to inhibit 
phospholipase A*. The relationships of all these properties and 
their physiological significance is still not understood and 
remains largely a matter of speculation. 

The rat CC10 mRNA is expressed like rat uteroglobin not only 
in lung but also in the esophagus as veil in uteri of estrogen and 
progesterone treated female rats (Hagen, G. 1990 supra) suggesting 
that rat CC10 is the rat counterpart of rat uteroglobin (see in 
general Wolf, M. et al., Human Mole cular Genetics . l(6):37l-378 
(1992) > . 

Human CC10 expression is abundant in non-neoplastic human 
lung, and it is detectable in tumors in corresponding cell lines 
at markedly lower levels (Broers, J.L.V. et al., Lab. Invest. . 
66:337-346 (1992); Linnoila, R.I. etal., Amer. J. Clin. Pathol.. 
90:1-12 (1988)). CC10 levels were also significantly lower in 
serum and bronchoalveolar lavage specimens obtained from smokers 
and lung cancer patients compared with specimens from healthy non- 
smokers (Bernard, A. et al., Buroo. Reap. J. . 5:1231-1238 (1992)). 

These findings suggest the expression of CC10 mRNA becomes 
altered in distinct lung compartments and may implicate a role for 
CC10 in the development of pulmonary carcinomas (Jensen, S.M. et 
al.. Int. J. Cancer. 58:629-637 (1994). 

Some of the biological properties of UG, such as masking the 
antigenicity of blastomers (Mukherjee, A.B., et al., Med. 
Hypotheses, 6:1043-1055 (1980)) and epididymal spermatozoa 
(Mukherjee, D.C., ec a J . , Science (Wash. D.C.) . 219:989-991 
(1983)), inhibition of monocyte and neutrophil chemotaxis and 
phagocytosis in vitro (Schiffman, E.V., et al., Agents Actions 
SUPPl , , 12:106-120 (1983)), and inhibition of ADP- and thrombin - 
induced (but not of arachidonic acid- induced) platelet aggregation 
(Manjunath, R. , et al., Biochem. Pharmacol . . 36:741-746 (1987)), 
may be due, at least in part, to the potent inhibitory effect of 
this protein on PLA ; activity (Levin, S.W., et al., Life Sci . . 
38:1813-1819 (1986)). A nonapeptide derived from the amino acid 
sequence of a-helix-3 of UG monomer (residues 39-47) possesses all 
the biological properties of the intact protein and has been 
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identified as an active site of TO responsible for its PLA 5 - 
inhibitory and antiinf lammatory activities tMiele. L. , et al., 
Ma r„r g (Lond.) , 335:726-730 (1988)). 

It has been indicated that cclOkD-specif ic transcripts are 
present in several nonrespiratory human organs and tissues. By 
using an antibody to rabbit TO. a OS-like immunoreactivity in human 
endometrium (Kikukawa. T.. et al.. ,T pm , Bn^ri^. Me^b, , 
67:31S-321 (1988)), prostate (ManyaJc, M.J.. et al.. J. 
140-176-182 (1988)). and respiratory tract (Dhanireddy, R.. at Ml.. 
B1r , h - ,« m hv. Pes. CO^mm,. 152:1447-1454 (1988,). has been 

described. , 

Recently, the cDHA (Singh. G-.. et al.. niochffl BlftPtlYB . 
950-329-337 (1988)) and the 5' regions (Wolf. M.. et al.. Hisasa 
tAni r^t. . 1:371-378 (1992)) of the gene encoding human 
uteroglobin (hTO) , a counterpart of rabbit TO (rTO) , has been 
characterized. Human TO or Clara cell 10-kD protein has si. 5% amino 
acid sequence identity with rTO (Singh. O.. ec al.. SiasbfiSU 
a<m(lv ,. Acta . 950:329-337 (1988)), S4.2% similarity with rat TO 
SSS^fl;^.. linn™ B10nhV B . K*. 1039 = 348-355 (1990,). 
and 52 8% with mouse TO (Singh. G. . et al.. BfP Tfunq Res, . 19:67- 
75 (1993, , . Although this protein was originally discovered in the 
alveolar Clara cells (Singh, G. , et al.. ,T nlntochem. , 36:73-80 
(1988,) it is detectable in many extrapulmonary tissues similar to 
the ones in which rTO is expressed (Peri, A., et al., WA Cell 
Biol 5-495-S03 (1994), and this expression is induced by 
progesterone, it appear that some of the biological properties of 
hTO are virtually identical to rTO (Mantile, G. , et al., L-Slfii. 
Chem. . 27:20343-20351 (1993,). 

It has been reported that TO in the rabbit uterine fluid is 
first detectable on day 3 of pregnancy, and peak level is reached 
on day 5 (for a review see Miele, L. . et al.. Rnri^r, M*. ■ 8:47 *- 
490 (1987)) . TO, by inhibiting PLA a activity, may down-regulate the 
production of proinflammatory lipid mediators, which promote 
contraction and motility of the uterine smooth muscle. Therefore, 
it is suggested that TO facilitates the maintenance of myometriai 
quiescence during gestation. 

There is a clear need in the art to further isolate and 
characterize proteins which are hooologues of mammalian Clara cell 
10 kDa secretory protein and rat prostatic steroid-binding protein. 
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The genes and gene products of the present invention display 
homology to the rat prostatic steroid-binding protein and Clara 
cell 10 JcDa secretory protein. 

SUMMARY OF THE INVENTION 

Toward these ends, and others, it is an object of the present 
invention to provide polypeptides, inter alia, that have been 
identified as novel hESF I, II and III by homology between the 
amino acid sequence set out in Figures 1, 2 and 3 (SBQ ID NO: 2, 4 
and 6) and known amino acid sequences of other proteins such as rat 
prostatic steroid -binding protein. 

It is a further object of the invention, moreover, to provide 
polynucleotides that encode hESF I, II and III, particularly 
polynucleotides that encode the polypeptides herein designated hESF 
I, II and III. 

In a particularly preferred embodiment of this aspect of the 
invention the polynucleotides comprise the regions encoding human 
hESF I, II and III in the sequence set out in Figures 1, 2 and 3 
(SEQ ID NO: 2, 4 and 6} . 

In accordance with this aspect of the present invention there 
is provided isolated nucleic acid molecules encoding mature 
polypeptides expressed by the human cDNA contained in ATCC Deposit 
No. 97401 <ESF I) , 97402 (BSF II) and 97403 (BSF III) . 

In accordance with this aspect of the invention there are 
provided isolated nucleic acid molecules encoding human hESF I, II 
and III, including mRNAe, cDNAs , genomic DNAs and, in further 
embodiments of this aspect of the invention, biologically, 
diagnostically, clinically or therapeutically useful variants, 
analogs or derivatives thereof, or fragments thereof, including 
fragments of the variants, analogs and derivatives. 

Among the particularly preferred embodiments of this aspect 
of the invention are naturally occurring allelic variants of human 
hESF I, II and III. 

It also is an object of the invention to provide hESF I, II 
and III polypeptides, particularly human hESF I, II and III 
polypeptides, that treat and/or prevent inflammation, asthma, 
rhinitis, cystic fibrosis, airway disease, neoplasia, atopy, 
inhibit phospholipase A ; activity, bind polychlorinated biphenyls, 
reduce foreign protein antigenicity, inhibit monocyte and 
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neutrophil chemotaxis and phagocytosis, inhibit platelet 
aggregation, regulate eicosanoid levels in the human uterus and 
control the growth of endometrial cells. 

In accordance with this aspect of the invention there are 
provided novel polypeptides of human origin referred to herein as 
hBSP I, II and III as well as biologically, diagnostically or 
therapeutically useful fragments, variants and derivatives thereof, 
variants and derivatives of the fragments, and analogs of the 
foregoing. 

Among the particularly preferred embodiments of this aspect 
of the invention are variants of human hESF I, II and III encoded 
by naturally occurring alleles of the human hBSP I, II and III 
genes . 

It is another object of the invention to provide a process for 
producing the aforementioned polypeptides, polypeptide fragments, 
variants and derivatives, fragments of the variants and 
derivatives, and analogs of the foregoing. in a preferred 
embodiment of this aspect of the invention there are provided 
methods for producing the aforementioned hBSP I, II and III 
polypeptides comprising culturing host cells having expressibly 
incorporated therein an exogenously- derived human hBSP I, II or III 
encoding polynucleotide under conditions for expression of human 
hBSP I, II and III in the host and then recovering the expressed 
polypeptides. 

in accordance with another object of the invention there are 
provided products, compositions, processes and methods that utilize 
the aforementioned polypeptides and polynucleotides for research, 
biological, clinical and therapeutic purposes, inter alia. 

In accordance with certain preferred embodiments of this 
aspect of the inventxon. there are provided products, compositions 
and methods, inter alia, for, among other things: assessing hBSP 
I, II and III expression in cells by determining hBSF I, II and III 
polypeptides or hBSF I, II and Ill-encoding mRNA; expressing hBSF 
I, II and III in vitro, ex vivo or in vivo by exposing cells to 
hBSF I, II and III polypeptides or polynucleotides as disclosed 
herein,' assaying genetic variation and aberrations, such as 
defects, in hESF I, II and III genes; and administering a hBSF I, 
II and III polypeptide or polynucleotide to an organism to augment 
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hBSF I, II and III function or remediate hESF I, II and III 
dysfunction. 

In accordance with certain preferred embodiments of this and 
other aspects of the invention there are provided probes that 
hybridize to human hBSF I , II and III sequences. 

In certain additional preferred embodiments of this aspect of 
the invention there are provided antibodies against hESF I, II and 
III polypeptides. In certain particularly preferred embodiments 
in this regard, the antibodies are highly selective for human hBSF 
I , II and III. 

In accordance with another aspect of the present invention, 
there are provided hBSF I, II and III agonists. Among preferred 
agonists are molecules that mimic hBSF I, II and III, that bind to 
hBSF I, II and III -binding molecules or receptor molecules, and 
that elicit or augment hBSF I, II and Ill-induced responses. Also 
among preferred agonists are molecules that interact with hBSF I, 

II and III polypeptides, or with other modulators of hBSF I, II and 

III activities, and thereby potentiate or augment an effect (s) of 
hBSF I, II and III. 

In accordance with yet another aspect of the present 
invention, there are provided hBSF I, II and III antagonists. 
Among preferred antagonists are those which mimic hBSF I, II and 
III so as to bind to hBSF I, II and III receptors or binding 
molecules but not elicit a hESF I, II and III -induced response or 
more than one hBSF I, II and I II -induced response or which prevent 
expression of hBSF I, II and III. Also among preferred antagonists 
are molecules that bind to or interact with hESF I, II and III so 
as to inhibit an effect (s) of hBSF I, II and III. 

The agonists and antagonists may be used to mimic, augment or 
inhibit the action of hBSF I, II and III polypeptides. They may 
be used, for instance, to treat and/or prevent an inherited 
susceptibility to asthma. 

In a further aspect of the invention there are provided 
compositions comprising a hESF I, II or III polynucleotide or a 
hESF I, II or III polypeptide for administration to cells in vitro, 
to cells ex vivo and to cells in vivo, or to a multicellular 
organism. In certain particularly preferred embodiments of this 
aspect of the invention, the compositions comprise a hESP I, II or 
III polynucleotide for expression of a hESF I, II or III 
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polypeptide in a host organism for treatment of disease. 
Particularly preferred in this regard is expression in a human 
patient for treatment of a dysfunction associated with aberrant 

endogenous activity. 

Other objects, features, advantages and aspects of the present 
invention will become apparent to those of skill from the following 
description. It should be understood, however, that the following 
description and the specific examples, while indicating preferred 
embodiments of the invention, are given by way of illustration 
only various changes and modifications within the spirit and 
scope of the disclosed invention will become readily apparent to 
those skilled in the art from reading the following description and 
from reading the other parts of the present disclosure. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The following drawings depict certain embodiments of the 
invention. They are illustrative only and do not limit the 
invention otherwise disclosed herein. 

Figure 1 shows the nucleotide and deduced amino acid sequence 

of human hBSP I. 

Figure 2 shows the nucleotide and deduced amino acid sequence 

of human hESF II. 

Figure 3 shows the nucleotide and deduced amino acid sequence 

of human hBSP III. 

Figure 4 shows the regions of similarity between amino acid 
sequences of hESP I and rat prostatic steroid-binding protein 
polypeptides (SEQ ID NO:25) . 

Figure 5 shows the regions of similarity between amino acid 
sequences of hBSP XI and rat prostatic steroid-binding protein 
polypeptides (SBO ID NO:26> . 

Figure 6 shows the regions of similarity between amino acid 
sequences of hESF III and rat prostatic steroid-binding protein 
polypeptides (SBO ID NO:27) . 

Figure 7 shows structural and functional features of hBSP I 
deduced by the indicated techniques, as a function of amino acid 
sequence . 
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Figure 8 shows structural and functional features of hESF II 
deduced by the indicated techniques, as a function of amino acid 
sequence . 

Figure 9 shows structural and functional features of hESF III 
deduced by the indicated techniques , as a function of amino acid 
sequence . 

GLOSSARY 

The following illustrative explanations are provided to 
facilitate understanding of certain terms used frequently herein, 
particularly in the examples. The explanations are provided as a 
convenience and are not limitative of the invention. 

DIGESTION of DMA refers to catalytic cleavage of the DNA with 
a restriction enzyme that acts only at certain sequences in the 
DNA. The various restriction enzymes referred to herein are 
coontercially available and their reaction conditions, cof actors and 
other requirements for use are known and routine to the skilled 
artisan. 

For analytical purposes, typically, 1 of plasmid or DNA 
fragment is digested with about 2 units of enzyme in about 20 pi 
of reaction buffer. For the purpose of isolating DNA fragments for 
plasmid construction, typically S to 50 ng of DNA are digested with 
20 to 250 units of enzyme in proportionately larger volumes. 

Appropriate buffers and substrate amounts for particular 
restriction enzymes are described in standard laboratory manuals, 
such as those referenced below, and they are specified by 
commercial suppliers. 

Incubation times of about 1 hour at 37 "C are ordinarily used, 
but conditions may vary in accordance with standard procedures, the 
supplier's instructions and the particulars of the reaction. After 
digestion, reactions may be analyzed, and fragments may be purified 
by electrophoresis through an agarose or polyacrylamide gel, using 
well known methods that are routine for those skilled in the art. 

GENETIC ELEMENT generally means a polynucleotide comprising 
a region that encodes a polypeptide or a region that regulates 
transcription or translation or other processes important to 
expression of the polypeptide in a host cell, or a polynucleotide 
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comprising both a region that encodes a polypeptide and a region 
operably linked thereto that regulates expression. 

Genetic elements may be comprised within a vector that 
replicates as an episomal element; that is, as a molecule 
physically independent of the host cell genome. They may be 
comprised within mini -chromosomes, such as those that arise during 
amplification of transfected DHA by methotrexate selection in 
eukaryotic cells. Genetic elements also may be comprised within 
a host cell genome; not in their natural state but, rather, 
following manipulation such as isolation, cloning and introduction 
into a host cell in the form of purified DHA or in a vector, among 
others . 

ISOLATBD means altered -by the hand of man- from its natural 
state; i.e., that, if it occurs in nature, it has been changed or 
removed from its original environment, or both. 

Por example, a naturally occurring polynucleotide or a 
polypeptide naturally present in a living animal in its natural 
state is not "isolated," but the same polynucleotide or polypeptide 
separated from the coexisting materials of its natural state is 
-isolated-, as the term is employed herein. For example, with 
respect to polynucleotides, the term isolated means that it is 
separated from the chromosome and cell in which it naturally 
occurs . 

as part of or following isolation, such polynucleotides can 
be joined to other polynucleotides, such as DMAs, for mutagenesis, 
to Conn fusion proteins, and for propagation or expression in a 
host for instance. The isolated polynucleotides, alone or joined 
to other polynucleotides such as vectors, can be introduced into 
host cells, in culture or in whole organisms. Introduced into host 
cells in culture or in whole organisms, such DNAs still would be 
isolated, as the term is used herein, because they would not be in 
their naturally occurring form or environment. Similarly, the 
polynucleotides and polypeptides may occur in a composition, such 
as a media formulations, solutions for introduction of 
polynucleotides or polypeptides, for example, into cells, 

compositions or solutions for chemical or enzymatic reactions, for 

instance, which are not naturally occurring compositions, and. 

therein remain isolated polynucleotides or polypeptides within the 

meaning of that term as it is employed herein. 
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LIGATION refers Co the process of forming phosphodiester bonds 
between two or more polynucleotides, which most often are double 
stranded DNAs. Techniques for ligation are well known to the art 
and protocols for ligation are described in standard laboratory 
manuals and references, such as , for instance, Sambrook et al., 
MOLECULAR CLONING, A LABORATORY MANUAL, 2nd Bd. ; Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (1969) and Maniatis 
- et al., pg. 146, as cited below. 

OLIGONUCLEOTIDE (S) refers to relatively short polynucleotides. 
Often the term refers to single-stranded deoxyribonucleotides, but 
it can refer as well to single -or double- stranded ribonucleotides, 
RNA:DNA hybrids and double -stranded DMAs, among others. 

Oligonucleotides, such as single -stranded DKA probe 
oligonucleotides, often are synthesized by chemical methods, such 
as those implemented on automated oligonucleotide synthesizers. 
However, oligonucleotides can be made by a variety of other 
methods, including in vitro recombinant DKA-mediated techniques and 
by expression of DNAs in cells and organisms. 

Initially, chemically synthesized DNAs typically are obtained 
without a S' phosphate. The 5' ends of such oligonucleotides are 
not substrates for phosphodieeter bond formation by ligation 
reactions that employ DNA ligases typically used to form 
recombinant DNA molecules . Where ligation of such oligonucleotides 
is desired, a phosphate can be added by standard techniques, such 
as those that employ a kinase and ATP. 

The 3' end of a chemically synthesized oligonucleotide 
generally has a free hydroxyl group and, in the presence of a 
ligase, such as T4 DNA ligase, readily will form a phosphodiester 
bond with a 5' phosphate of another polynucleotide, such as another 
oligonucleotide. As is well known, this reaction can be prevented 
selectively, where desired, by removing the 5' phosphates of the 
other polynucleotide (s) prior to ligation. 

PLASMIDS generally are designated herein by a lower case p 
preceded and/or followed by capital letters and/or numbers, in 
accordance with standard naming conventions that are familiar to 
those of skill in the art. 

Starting plasmids disclosed herein are either commercially 
available, publicly available on an unrestricted basis, or can be 
constructed from available plasmids by routine application of well 
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known, published procedures. Many plasmids and other cloning and 
expression vectors that can be used in accordance with the present 
invention are well known and readily available to those of skill 
in the art. Moreover, those of skill readily may construct any 
number of other plasmids suitable for use in the invention. The 
properties, construction and use of such plasmids, as well as other 
vectors, in the present invention will be readily apparent to those 
of skill from the present disclosure. 

POLYNUCLEOTIDE ( S ) generally refers to any polyribonucleotide 
or polydeoxribonucleotide, which may be unmodified RNA or DNA or 
modified RNA or DNA. Thus, for instance, polynucleotides as used 
herein refers to, among others, single-and double -stranded DNA, DMA 
that is a mixture of single-and double -stranded regions, single- 
and double-stranded RNA, and RNA that is mixture of single- and 
double-stranded regions, hybrid molecules comprising DNA and RNA 
that may be single-stranded or, more typically, double -stranded or 
a mixture of single- and double-stranded regions. In addition, 
polynucleotide as used herein refers to triple -stranded regions 
comprising RNA or DNA or both RNA and DNA. The strands in such 
regions may be from the same molecule or from different molecules. 
The regions may include all of one or more of the molecules, but 
more typically involve only a region of some of the molecules, one 
of the molecules of a triple-helical region often is an 

oligonucleotide . 

As used herein, the term polynucleotide includes DNAs or RNAs 
as described above that contain one or more modified bases. Thus, 
DNAs or RNAs with backbones modified for stability or for other 
reasons are "polynucleotides" as that term is intended herein. 
Moreover, DNAs or RNAs comprising unusual bases, such as inosine, 
or modified bases, such as tritylated bases, to name just two 
examples, are polynucleotides as the term is used herein. 

It will be appreciated that a great variety of modifications 
have been made to DNA and RNA that serve many useful purposes known 
to those of skill in the art. The term polynucleotide as it is 
employed herein embraces such chemically, enzymatically or 
metabolically modified forms of polynucleotides, as well as the 
chemical forms of DNA and RNA characteristic of viruses and cells, 
including simple and complex cells, inter alia. 
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POLYPEPTIDES, as used herein, includes all polypeptides as 
described below. The basic structure of polypeptides is well known 
and has been described in innumerable textbooks and other 
publications in the art. In this context, the term is used herein 
to refer to any peptide or protein comprising two or more amino 
acids joined to each other in a linear chain by peptide bonds. As 
used herein, the term refers to both short chains, which also 
commonly are referred to in the art as peptides, oligopeptides and 
oligomers, for example, and to longer chains, which generally are 
referred to in the art as proteins, of which there are many types. 

It will be appreciated that polypeptides often contain amino 
acids other than the 20 amino acids commonly referred to as the 20 
naturally occurring amino acids, and that many amino acids, 
including the terminal amino acids, may be modified in a given 
polypeptide, either by natural processes, such as processing and 
other post-translational modifications, but also by chemical 
modification techniques which are well known to the art. Even the 
common modifications that occur naturally in polypeptides are too 
numerous to list exhaustively here, but they are well described in 
basic texts and in more detailed monographs, as well as in a 
voluminous research literature, and they are well known to those 
of skill in the art. Among the known modifications which may 

be present in polypeptides of the present are, to name an 
illustrative few, acetylation, acylation, ADP-ribosylation, 
amidation, covalent attachment of flavin, covalent attachment of 
a heme moiety, covalent attachment of a nucleotide or nucleotide 
derivative, covalent attachment of a lipid or lipid derivative, 
covalent attachment of phosphotidylinositol, cross -linking, 
cyclization, disulfide bond formation, demethylation, formation of 
covalent cross -links, formation of cystine, formation of 
pyroglutamate, formylation, gamma -carboxylat ion, glycosylation, GPI 
anchor formation, hydroxy lat ion, iodination, methylation, 
myristoylation, oxidation, proteolytic processing, phosphorylation, 
prenylation, racemization, selenoylation, sulfation, transfer-RNA 
mediated addition of amino acids to proteins such as arginylation, 
and ubiquitination. 

Such modifications are well known to those of skill and have 
been described in great detail in the scientific literature. 
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Several particularly common modifications, gly cosy lat ion , lipid 
attachment, sulfation, gamma -carboxylat ion of glutamic acid 
residues, hydroxylation and ADP-ribosylation, for instance, are 
described in most basic texts, such as, for instance PROTEINS - 
STRUCTURE AND MOLECULAR PROPERTIES , 2nd Ed., T. B. Creighton, W. 
H. Freeman and Company, New York (1993) . Many detailed reviews are 
available on this subject, such as, for example, those provided by 
Wold, P.. Posttranslational Protein Modifications: Perspectives and 
Prospects, pge. 1-12 in POSTTRANSLATIONAL COVALENT MODIFICATION OF 
PROTEINS,' B. C. Johnson, Ed., Academic Press, New York (1983) ; 
Seifter et al., Analysis for protein modifications and nonprotein 
cofactors. Meth. Bnzymol. 182: 626-646 (1990) and Rattan et al., 
Protein Synthesis: Posttranslational Modifications and Aging, Ann. 
N.Y. Acad. Sci. 663: 48-62 (1992). 

It will be appreciated, as is well known and as noted above, 
that polypeptides are not always entirely linear. For instance, 
polypeptides may be branched as a result of ubiquitination, and 
they may be circular, with or without branching, generally as a 
result of posttranslation events, including natural processing 
event and events brought about by human manipulation which do not 
occur naturally. Circular, branched and branched circular 
polypeptides may be synthesized by non-translation natural process 
and by entirely synthetic methods, as well. 

Modifications can occur anywhere in a polypeptide, including 
the peptide backbone, the amino acid side-chains and the amino or 
carboxyl termini. In fact, blockage of the amino or carboxyl group 
in a polypeptide, or both, by a covalent modification, is common 
in naturally occurring and synthetic polypeptides and such 
modifications may be present in polypeptides of the present 
invention, as well. For instance, the amino terminal residue of 
polypeptides made in B. coli, prior to proteolytic processing, 
almost invariably will be N-formylmethionine. 

The modifications that occur in a polypeptide often will be 
a function of how it is made. For polypeptides made by expressing 
a cloned gene in a host, for instance, the nature and extent of the 
modifications in large part will be determined by the host cell 
posttranslational modification capacity and the modification 
signals present in the polypeptide amino acid sequence. For 
instance, as is well known, glycosylation often does not occur in 
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bacterial hosts such as E. coli. Accordingly, when glycosylation 
is desired, a polypeptide should be expressed in a glycosylating 
host, generally a eukaryotic cell. Insect cell often carry out the 
same posttranslational g lycos ylat ions as mammalian cells and, for 
this reason, insect cell expression systems have been developed to 
express efficiently mammalian proteins having native patterns of 
glycosylation, inter alia. Similar considerations apply to other 
modifications. 

It will be appreciated that the same type of modification may 
be present in the same or varying degree at several sites in a 
given polypeptide. Also, a given polypeptide may contain many 
types of modifications. 

In general, as used herein, the term polypeptide encompasses 
all such modifications, particularly those that are present in 
polypeptides synthesized by expressing a polynucleotide in a host 
cell. 

VARIANT (S) of polynucleotides or polypeptides, as the term is 
used herein, are polynucleotides or polypeptides that differ from 
a reference polynucleotide or polypeptide, respectively. Variants 
in this sense are described below and elsewhere in the present 
disclosure in greater detail. 

(1) A polynucleotide that differs in nucleotide sequence from 
another, rsference polynucleotide. Generally, differences are 
limited so that the nucleotide sequences of the reference and the 
variant are closely similar overall and, in many regions, 
identical . 

As noted below, changes in the nucleotide sequence of the 
variant may be silent^ That is, they may not alter the amino acids 
encoded by the polynucleotide. Where alterations are limited to 
silent changes of this type a variant will encode a polypeptide 
with the same amino acid sequence as the reference. Also as noted 
below, changes in the nucleotide sequence of the variant may alter 
the amino acid sequence of a polypeptide encoded by the reference 
polynucleotide. Such nucleotide changes may result in amino acid 
substitutions, additions, deletions, fusions and truncations in the 
polypeptide encoded by the reference sequence, as discussed below. 

(2) A polypeptide that differs in amino acid sequence from 
another, reference polypeptide. Generally, differences are limited 
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bo that the sequences of the reference and the variant are closely 
similar overall and, in many region, identical. 

A variant and reference polypeptide may differ in amino acid 
sequence by one or more substitutions, additions, deletions, 
fusions and truncations, which may be present in any combination. 

RECEPTOR MOLECULE, as used herein, refers to molecules which 
bind or interact specifically with hBSP I. II and III polypeptides 
of the present invention, including not only classic receptors, 
which are preferred, but also other molecules that specifically 
bind to or interact with polypeptides of the invention (which also 
nay be referred to as -binding molecules- and -interaction 
molecules,- respectively and as "hBSP I, II and III binding 
molecules- and "hESF I. II and III interaction molecules.- Binding 
between polypeptides of the invention and such molecules, including 
receptor or binding or interaction molecules may be exclusive to 
polypeptides of the invention, which is very highly preferred, or 
it may be highly specific for polypeptides of the invention, which 
is highly preferred, or it may be highly specific to a group of 
proteins that includes polypeptides of the invention, which is 
preferred, or it may be specific to several groups of proteins at 
least one of which includes polypeptides of the invention. 

Receptors also may be non-naturally occurring, such as 
antibodies and antibody -derived reagents that bind to polypeptides 
of the invention. 

DESCRIPTION OF THE INVENTION 
The present invention relates to novel hESF I. II and III 
polypeptides and polynucleotides, among other things, as described 
in greater detail below. In particular, the invention relates to 
polypeptides and polynucleotides of novel human hESF I, II and III, 
which are related by amino acid sequence homology to the rat 
prostatic steroid-binding protein. The invention relates 
especially to hESF I. II and III having the nucleotide and amino 
acid sequences set out in Figures 1, 2 and 3 (SEQ ID NO.1-6) and 
to the hESF I, II and III nucleotides and amino acid sequences of 
the human cDNAs in ATCC Deposit No. 97401, 97402 and 97403 which 
is herein referred to as -the deposited clone- or as the -cDNA of 
the deposited clone.- It will be appreciated that the nucleotide 
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and amino acid sequences set out in Figures 1, 2 and 3 (SBQ id 
NO; 2, 4 and 6) were obtained by sequencing the cDNA of the 
deposited clone. Hence, the sequence of the deposited clone is 
controlling as to any discrepancies between the two and any 
reference to the sequences of Figures l, 2 and 3 (SBQ ID N0:1, 3 
and 5) include reference to the sequence of the human cDNA of the 
deposited claim. 

Polynucleotides 

in accordance with one aspect of the present invention, there 
are provided isolated polynucleotides which encode hBSP I, II and 
III polypeptides having the deduced amino acid sequences of Figures 
1, 2 and 3 (SBQ ID NO: 2, 4 and 6). 

Using the information provided herein, such as the 
polynucleotide sequences set out in Figures 1, 2 and 3 (SBQ ID 
N0:1, 3 and S) a polynucleotide of the present invention encoding 
human hBSF I, II and III polypep tided may be obtained using 
standard cloning and screening procedures, such as those for 
cloning cDNAs using mRKA from cells of a human endometrial tumor 
as starting material. Illustrative of the invention, the 
polynucleotide set out in Figure 1 (SBQ ID N0:1) was discovered in 
a cDNA library derived from cells of a human endometrial tumor. 
The polynucleotide of Figure 2 (SBQ ID NO: 3) was discovered in a 
cDNA library derived from cyclohexamide treated CBM cells. The 
polynucleotide of Pigure 3 (SBQ ID NO: 5) was discovered in a cDNA 
library derived from human endometrial tumor. 

Human hBSF I of the invention is structurally related to other 
proteins of the Clara cell secretory protein family, as shown by 
the results of sequencing the cDNA encoding human hBSF I in the 
deposited clone. The cDNA sequence thus obtained is set out in 
Figure 1 (SBQ ID NO:i) . It contains an open reading frame encoding 
a protein of about 90 amino acid residues, wherein the initial 21 
amino acid residues represent a putative leader sequence, with a 
deduced molecular weight of the full-length protein of about 9.8 
kDa. The protein exhibits greatest homology to the rat prostatic 
steroid-binding protein, among known proteins. The protein hBSF 
I has about 46.067% identity and about 66.3% similarity with the 
amino acid sequence of the rat prostatic steroid-binding protein. 
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Human hBSP II contains an open reading frame encoding a 
protein of about 90 amino acid residues, wherein the initial 21 
amino acid residues represent a putative leader sequence, with a 
deduced molecular weight of the full-length protein of about 9.9 
kDa. The protein exhibits greatest homology to the rat prostatic 
steroid-binding protein, among known proteins. The protein hBSP 

II has about 49.438% identity and about 71.910% similarity with the 
amino acid sequence of rat prostatic steroid-binding protein C2. 

Human hBSP III contains an open reading frame encoding a 
protein of about 95 amino acid residues, wherein the initial 21 
amino acid residues represent a putative leader sequence, with a 
deduced molecular weight of the full-length protein of about 8.10 
kDa. The protein exhibits greatest homology to rat prostatic 
steroid-binding protein C3, among known proteins. The protein hBSP 

III has about 36.2% identity and about 64.9% similarity with the 
amino acid sequence of the rat prostatic steroid-binding protein 

03 ' Polynucleotides of the present invention may be in the form 
of RNA such as cnSNA, or in the form of DMA, including, for 
instance, cDNA and genomic DNA obtained by cloning or produced by 
chemical synthetic techniques or by a combination thereof. The DNA 
may be double-stranded or single-stranded. Single-stranded DNA may 
be the coding strand, also known as the sense strand, or it may be 
the non-coding strand, also referred to as the anti-sense strand. 

The coding sequence -hich encodes the polypeptides may be 
identical to the coding sequence of the polynucleotides shown in 
Pigures 1. 2 and 3 (SEQ ID NO:l. 3 and 5) . It also may be a 
polynucleotide with a different sequence, which, as a result of the 
redundancy (degeneracy) of the genetic code, encodes the 
polypeptides of the human cDNA of Figures 1, 2 and 3 (SBQ NO:1 - 

3 ^Polynucleotides of the present invention which encode the 
polypeptides of Figures 1. 2 and 3 <SBQ ID RO.l. 3 and 5> may 
include, but are not limited to the coding sequence for the mature 
polypeptide, by itself; the coding sequence for the mature 
polypeptide and additional coding sequences, such as those encoding 
a leader or secretory sequence, such as a pre-, or pro- or prepro- 
protein sequence.- the coding sequence of the mature polypeptide. 
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with or without the aforementioned additional coding sequences, 
together with additional, non-coding sequences, including for 
example, but not limited to introns and non-coding 5* and 3' 
sequences, such as the transcribed, non -translated sequences that 
play a role in transcription, mRNA processing - including splicing 
and polyadenylation signals, for example - ribosome binding and 
stability of mRNA; additional coding sequence which codes for 
additional amino acids, such as those which provide additional 
functionalities. Thus, for instance, the polypeptide may be fused 
to a marker sequence, such as a peptide, which facilitates 
purification of the fused polypeptide. In certain preferred 
embodiments of this aspect of the invention, the marker sequence 
is a hexa-histidine peptide, such as the tag provided in the pQB 
vector (Qiagen, Inc., among others, many of which are commercially 
available. As described in Gentz et al., Proc. Natl. Acad. Sci., 
USA 86: 621-624 (1989), for instance, hexa-histidine provides for 
convenient purification of the fusion protein. The HA tag 
corresponds to an epitope derived of influenza hemagglutinin 
protein, which has been described by Wilson et al.. Cell 37: 767 
(1984), for instance. 

In accordance with the foregoing, the term "polynucleotide 
encoding a polypeptide" as used herein encompasses polynucleotides 
which include a sequence encoding a polypeptide of the present 
invention, particularly human hBSP I, II and III having the amino 
acid sequences set out in Figures 1, 2 and 3 (SBQ ID NO: 2, 4 and 
6) . The term encompasses polynucleotides that include a single 
continuous region or discontinuous regions encoding the polypeptide 
(for example, interrupted by introns) together with additional 
regions, that also may contain coding and/or non-coding sequences. 

The present invention further relates to variants of the 
herein above described polynucleotides which encode for fragments, 
analogs and derivatives of the polypeptides having the deduced 
amino acid sequences of Figures l, 2 and 3 (SEQ ID NO: 2, 4 and 6) . 
A variant of the polynucleotide may be a naturally occurring 
variant such as a naturally occurring allelic variant, or it may 
be a variant that is not known to occur naturally. Such non- 
naturally occurring variants of the polynucleotide may be made by 
mutagenesis techniques, including those applied to polynucleotides, 
cells or organisms. 
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Among variants in this regard are variants that differ from 
the aforementioned polynucleotides by nucleotide substitutions, 
deletions or additions. The substitutions, deletions or additions 
may involve one or more nucleotides. The variants may be altered 
in coding or non-coding regions or both. Alterations in the coding 
regions may produce conservative or non-conservative amino acid 
substitutions, deletions or additions. 

Among the particularly preferred embodiments of the invention 
in this regard are polynucleotides encoding polypeptides having the 
amino acid sequence of hBSP I, II and III set out in Figures 1, 2 
and 3 (SBQ ID NO: 2, 4 and 6) ; variants, analogs, derivatives and 
fragments thereof, and fragments of the variants, analogs and 
derivatives . 

Further particularly preferred in this regard are 
polynucleotides encoding hBSF I, II and III variants, analogs, 
derivatives and fragments, and variants, analogs and derivatives 
of the fragments, which have the amino add sequence of the hBSF 
I II or III polypeptides of Figures 1. 2 and 3 (SBQ ID S0:2. 4 and 
6 i in which several, a few. 5 to 10. 1 to 5. 1 to 3, 2, 1 or no 
amino acid residues are substituted, deleted or added, in any 
combination. Bepecially preferred among these are sxlent 
substitutions, additions and deletions, which do not alter the 
properties and activities of the hBSF I. II and III. Also 
especially preferred in this regard are conservative substitutions. 
Most highly preferred are polynucleotides encoding polypeptides 
having the amino acid sequences of Figures 1. 2 and 3 (SBQ ID NO 
4 and 6). without substitutions. 

Further preferred embodiments of the invention are 
polynucleotides that are at least 70% identical to a 
polynucleotides encoding the hBSF I. II and III P 01 ^"*"**™* 
the amino acid sequence, set out in Figures !. 2 and 3 (SBQ ID 
NO-2 4 and 6) . and polynucleotides which are complementary to such 
polynucleotides. Alternatively, most highly preferred are 
polynucleotides that comprise a region that is at least 80* 
identical to a polynucleotide encoding the hBSF I II or III 
polypeptides of the cDNA of the deposited clone and polynucleotides 
complementary thereto. In this regard, polynucleotides at least 
90% identical to the same are particularly preferred, and among 
these particularly preferred polynucleotides, those with at least 
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95% are especially preferred. Furthermore, those with at least 97% 
are highly preferred among those with at least 95%, and among these 
those with at least 98% and at least 99% are particularly highly 
preferred, with at least 99% being the more preferred. 

Particularly preferred embodiments in this respect, moreover, 
are polynucleotides which encode polypeptides which retain 
substantially the same biological function or activity as the 
mature polypeptide encoded by the human cDNA of Figures l, 2 and 
3 (SBQ ID N0:1, 3 and 5) . 

The present invention further relates to polynucleotides that 
hybridize to the herein above -described sequences. In this regard, 
the present invention especially relates to polynucleotides which 
hybridize under stringent conditions to the herein above -de scribed 
polynucleotides. As herein used, the term "stringent conditions" 
means hybridization will occur only if there is at least 95% and 
preferably at least 97% identity between the sequences. 

As discussed additionally herein regarding polynucleotide 
assays of the invention, for instance, polynucleotides of the 
invention as discussed above, may be used as a hybridization probe 
for cDNA and genomic DNA to isolate full-length cDNAs and genomic 
clones encoding hESF I, II or III and to isolate cDNA and genomic 
clones of other genes that have a high sequence similarity to the 
human hESF I, II or III genes. Such probes generally will comprise 
at least 15 bases. Preferably, such probes will have at least 30 
bases and may have at least 50 bases. 

For example, the coding region of the hESF I, II and III genes 
may be isolated by screening using the known DNA sequence to 
synthesize an oligonucleotide probe. A labeled oligonucleotide 
having a sequence complementary to that of a gene of the present 
invention is then used to screen a library of human cDNA, genomic 
DNA or mRNA to determine which members of the library the probe 
hybridizes to. 

The polynucleotides and polypeptides of the present invention 
may be employed as research reagents and materials for discovery 
of treatments and diagnostics to human disease, as further 
discussed herein relating to polynucleotide assays, inter alia. 

The polynucleotides may encode a polypeptide which is the 
mature protein plus additional amino or carboxyl- terminal amino 
acids, or amino acids interior to the mature polypeptide (when the 
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mature form has more than one polypeptide chain, for instance) . 
Such sequences may play a role in processing of a protein from 
precursor to a mature form, may facilitate protein trafficking, may 
prolong or shorten protein half -life or may facilitate manipulation 
of a protein for assay or production, among other things. As 
generally is the case in situ, the additional amino acids may be 
processed away from the mature protein by cellular enzymes. 

A precursor protein, having the mature form of the polypeptide 
fused to one or more prosequences may be an inactive form of the 
polypeptide. When prosequences are removed such inactive 
precursors generally are activated. Some or all of the 
prosequences may be removed before activation. Generally, such 
precursors are called proproteins. 

In sum, a polynucleotide of the present invention may encode 
a mature protein, a mature protein plus a leader sequence (which 
may be referred to as a preprotein) , a precursor of a mature 
protein having one or more prosequences which are not the leader 
sequences of a preprotein. or a preproprotein, which is a precursor 
to a proprotein, having a leader sequence and one or more 
prosequences, which generally are removed during processing steps 
that produce active and mature forms of the polypeptide. 

Deposited materials 

A deposit containing a human hESP I, II and III cDNA has been 
deposited with the American Type Culture Collection, as noted 
above. Also as noted above, the cDNA deposit is referred to herein 
as "the deposited clone" or as "the cDNA of the deposited clone." 

The deposited clone was deposited with the American Type 
Culture Collection, 12301 Park Lawn Drive, Rocfcville, Maryland 
20852, USA, on January 2, 1996 and assigned ATCC Deposit No. 97401, 

97402 and 97403. 

The deposited materials are pBluescript SK (-) plasmids 
(Stratagene, La Jolla. CA) containing the full length hBSP I. II 
and III cDNA. 

The deposit has been made under the terms of the Budapest 
Treaty on the international recognition of the deposit of micro- 
organisms for purposes of patent procedure. The strain will be 
irrevocably and without restriction or condition released to the 
public upon the issuance of a patent. The deposit is provided 
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merely as convenience to chose of skill in the art and is not an 
acini sb ion that a deposit is required for enablement, such as that 
required under 35 U.5.C. §112. 

The sequence of the polynucleotides contained in the deposited 
material, as well as the amino acid sequence of the polypeptide 
encoded thereby, are controlling in the event of any conflict with 
any description of sequences herein. 

A license may be required to maJce, use or sell the deposited 
materials, and no such license is hereby granted. 

Polypeptides 

The present invention further relates to a human hBSP I, II 
and III polypeptide which has the deduced amino acid sequence of 
Figures 1, 2 and 3 (SBQ ID NO: 2, 4 and 6) . 

The invention also relates to fragments, analogs and 
derivatives of these polypeptides. The terms "fragment," 
"derivative" and "analog" when referring to the polypeptide of 
Figures 1, 2 and 3 (SBQ ID NO: 2, 4 and €) means a polypeptide which 
retains essentially the same biological function or activity as 
such polypeptide. Thus, an analog includes a proprotein which can 
be activated by cleavage of the proprotein portion to produce an 
active mature polypeptide. 

The polypeptide of the present invention may be a recombinant 
polypeptide, a natural polypeptide or a synthetic polypeptide. In 
certain preferred embodiments it is a recombinant polypeptide. 

The fragment, derivative or analog of the polypeptide of 
Figures 1, 2 and 3 (SBQ ID NO: 2, 4 and 6) may be (i) one in which 
one or more of the amino acid residues are substituted with a 
conserved or non -conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid 
residue may or may not be one encoded by the genetic code, or (ii) 
one in which one or more of the amino acid residues includes a 
substituent group, or (iii) one in which the mature polypeptide is 
fused with another compound, such as a compound to increase the 
half -life of the polypeptide {for example, polyethylene glycol), 
or (iv> one in which the additional amino acids are fused to the 
mature polypeptide, such as a leader or secretory sequence or a 
sequence which is employed for purification of the mature 
polypeptide or a proprotein sequence. Such fragments, derivatives 
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and analogs are deemed to be within the scope of those skilled in 
the art from the teachings herein. 

Among preferred variants are those that vary from a reference 
by conservative amino acid substitutions. Such substitutions are 
those that substitute a given amino acid in a polypeptide by 
another amino acid of like characteristics. Typically seen as 
conservative substitutions are the replacements, one for another, 
among the aliphatic amino acids Ala, Val, Leu and He; interchange 
of the hydroxyl residues Ser and Thr, exchange of the acidic 
residues Asp and Glu, substitution between the amide residues Asn 
and Gin, exchange of the basic residues Lys and Arg and 
replacements among the aromatic residues Phe, Tyr. 

The polypeptides and polynucleotides of the present invention 
are preferably provided in an isolated form, and preferably are 
purified to homogeneity. 

The polypeptides of the present invention include the 
polypeptide of SBQ ID NO: 2 (in particular the mature polypeptide) 
as well as polypeptides which have at least 75% similarity 
(preferably at least 75% identity) to the polypeptide of SBQ ID 
NO- 2 and more preferably at least 90% similarity (more preferably 
at least 90% identity) to the polypeptide of SBQ ID NO:2 and still 
more preferably at least 95% similarity (still more preferably at 
least 95% identity) to the polypeptide of SBQ ID NO: 2 and also 
include portions of such polypeptides with such portion of the 
polypeptide generally containing at least 30 amino acids and more 
preferably at least 50 amino acids. 

As known in the art -similarity" between two polypeptides is 
determined by comparing the amino acid sequence and its conserved 
amino acid substitutes of one polypeptide to the sequence of a 

second polypeptide. 

Fragments or portions of the polypeptides or the present 
invention may be employed for producing the corresponding full- 
length polypeptide by peptide synthesis; therefore, the fragments 
may be employed as intermediates for producing the full-length 
polypeptides. Fragments or portions of the polynucleotides of the 
present invention may be used to synthesize full-length 
polynucleotides of the present invention. 



Pragments 
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Also among preferred embodiments of this aspect of the present 
invention are polypeptides comprising fragments of hESF I, XI and 
III, most particularly fragments of the hESF I, II and III having 
the amino acid sequence set out in Pigures 1, 2 and a (SBQ ID NO: 2, 
4 and 6) , and fragments of variants and derivatives of the hJZSF I, 
II and III of Figures 1, 2 and 3 (SBQ ID N0:2, 4 and 6). 

In this regard a fragment is a polypeptide having an amino 
acid sequence that entirely is the same as part but not all of the 
amino acid sequence of the aforementioned hESF I, II and III 
polypeptides and variants or derivatives thereof. 

Such fragments may be "free-standing," i.e., not part of or 
fused to other amino acids or polypeptides, or they may be 
comprised within a larger polypeptide of which they form a part or 
region. When comprised within a larger polypeptide, the presently 
discussed fragments most preferably form a single continuous 
region. However, several fragments may be comprised within a 
single larger polypeptide. For instance, certain preferred 
embodiments relate to a fragment of a hESF I, II or III polypeptide 
of the present comprised within a precursor polypeptide designed 
for expression in a host and having heterologous pre and pro- 
polypeptide regions fused to the amino terminus of the hESF I, II 
and III fragment and an additional region fused to the carboxyl 
terminus of the fragment. Therefore, fragments in one aspect of 
the meaning intended herein, refers to the portion or portions of 
a fusion polypeptide or fusion protein derived from hESF I, II and 
III. 

As representative examples of polypeptide fragments of the 
invention, there may be mentioned those which have from about 15 
to about 139 amino acids. 

In this context about includes the particularly recited range 
and ranges larger or smaller by several, a few, 5, 4, 3, 2 or 1 
amino acid at either extreme or at both extremes. Highly 
preferred in this regard are the recited ranges plus or minus as 
many as 5 amino acids at either or at both extremes. Particularly 
highly preferred are the recited ranges plus or minus as many as 
3 amino acids at either or at both the recited extremes. 
Especially preferred are ranges plus or minus l amino acid at 
either or at both extremes or the recited ranges with no additions 
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or deletions. Host highly preferred of all in this regard are 
fragments from about 15 to about 45 amino acids. 

Among especially preferred fragments of the invention are 
truncation mutants of hBSP I. II and III. Truncation mutants 
include hBSP I. II and III polypeptides having the amino acid 
sequence of Figures 1. 2 and 3 (6BQ ID N0:2. 4 and 6) . or variants 
or derivatives thereof, except for deletion of a continuous series 
of residues (that is, a continuous region, part or portion) that 
includes the amino terminus, or a continuous series of residues 
that includes the carboxyl terminus or. as in double truncation 
mutants, deletion of two continuous series of residues, one 
including the amino terminus and one including the carboxyl 
terminus. Fragments having the size ranges set out about also are 
preferred embodiments of truncation fragments, which are especially 
preferred among fragments generally. 

Also preferred in this aspect of the invention are fragments 
characterized by structural or functional attributes of hBSP I. II 
and III Preferred embodiments of the invention in this regard 
include fragments that comprise alpha-helix and alpha-helix forming 
regions ("alpha-regions-), beta-sheet and beta-sheet-forming 
regions ("beta-regions"), turn and turn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions"), 
hydrophilic regions, hydrophobic regions, alpha amphipathic 
regionB. beta amphipathic regions, flexible regions, surface- 
forming regions and high antigenic index regions of hBSF I. II and 

Certain preferred regions in these regards are set out in 
Figures 4. 5 and 6 and include, but are not limited to, regions of 
the aforementioned types identified by analysis of the amino acid 
sequence set out in Figures 1. 2 and 3 (SBQ ID H0:2, 4 and 6) . As 
set out in Figures 4, 5 and 6 such preferred regions include 
Garnier-Robson alpha -regions, beta-regions, turn-regions and coil- 
regions. Chou-Fasman alpha-regions, beta-regions and turn-regions, 
Kyte-Doolittle hydrophilic regions and hydrophilic regions, 
Bisenberg alpha and beta amphipathic regions. Karplus-Schulz 
flexible regions. Bmini surface-forming regions and Jameson-wolf 
high antigenic index regions. 

Among highly preferred fragments in this regard are those that 
comprise regions of hBSF I. II and III that combine several 
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structural features, such as several of the features set out above. 
In this regard, the regions defined by the residues of Figures l, 
2 and 3 (SBQ ID NO: 2* 4 and 6), which all are characterized by 
amino acid compositions highly characteristic of turn-regions, 
hydrophilic regions, flexible -regions, surf ace -forming regions, and 
high antigenic index- regions, are especially highly preferred 
regions. Such regions may be comprised within a larger polypeptide 
or may be by themselves a preferred fragment of the present 
invention, as discussed above. It will be appreciated that the 
term "about" as used in this paragraph has the meaning set out 
above regarding fragments in general. 

Further preferred regions are those that mediate activities 
of hBSF I, II and III. Most highly preferred in this regard are 
fragments that have a chemical, biological or other activity of 
hBSF I, II and III, including those with a similar activity or an 
improved activity, or with a decreased undesirable activity. 
Highly preferred in this regard are fragments that contain regions 
that are horaologs in sequence, or in position, or in both sequence 
and to active regions of related polypeptides, such as the related 
polypeptides set out in Figures 4, 5 and 6 (SBQ ID NO: 2, 4 and 6) 
and which include rat prostatic specif ic- binding proteins. Among 
particularly preferred fragments in these regards are truncation 
mutants, as discussed above. 

It will be appreciated that the invention also relates to, 
among others, polynucleotides encoding the aforementioned 
fragments, polynucleotides that hybridize to polynucleotides 
encoding the fragments, particularly those that hybridize under 
stringent conditions, and polynucleotides, such as PCR primers, for 
amplifying polynucleotides that encode the fragments. In these 
regards, preferred polynucleotides are those that correspondent to 
the preferred fragments, as discussed above. 

Vectors, host cells, expression 

The present invention also relates to vectors which include 
polynucleotides of the present invention, host cells which are 
genetically engineered with vectors of the invention and the 
production of polypeptides of the invention by recombinant 
techniques . 
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Host cells can be genetically engineered to incorporate 
polynucleotides and express polypeptides of the present invention. 
For instance, polynucleotides nay be introduced into host cells 
using well known techniques of infection, transduction, 
tranaf action, transvection and transformation. The polynucleotides 
may be introduced alone or with other polynucleotides. Such other 
polynucleotides way be introduced Independently, co- introduced or 
introduced joined to the polynucleotides of the invention. 

Thus, for instance, polynucleotides of the invention may be 
transfected into host cells with another, separate, polynucleotide 
encoding a selectable marker, using standard techniques for co- 
transfection and selection in. for instance, mammalian cells. In 
this case the polynucleotides generally will be stably incorporated 
into the host cell genome. 

Alternatively, the polynucleotides may be joined to a vector 
containing a selectable marker for propagation in a host. The 
vector construct may be introduced into host cells by the 
aforementioned techniques. Generally, a plasmid vector is 
introduced as DHA in a precipitate, such as a calcium phosphate 
precipitate, or in a complex with a charged lipid. Blectroporation 
also may be used to introduce polynucleotides into a host. If the 
vector is a virus, it may be packaged in vitro or introduced into 
a packaging cell and the packaged virus may be transduced into 
cells A wide variety of techniques suitable for making 
polynucleotides and for introducing polynucleotides into cells in 
accordance with this aspect of the invention are well known and 
routine to those of skill in the art. Such techniques are reviewed 
at length in Sambrook et al. cited above, which is illustrative of 
the many laboratory manuals that detail these techniques. I n 
accordance with this aspect of the invention the vector may be, for 
example, a plasmid vector, a single or double-stranded phage 
vector, a single or double -stranded RNA or DMA viral vector. Such 
vectors may be introduced into cells as polynucleotides, preferably 
DHA by well known techniques for introducing DNA and RHA into 
cells. The vectors, in the case of phage and viral vectors also 
may be and preferably are introduced into cells as packaged or 
encapsulated virus by well known techniques for infection and 
transduction. Viral vectors may be replication competent or 
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replication defective. In the latter case viral propagation 
generally will occur only in complementing host cells. 

Preferred among vectors, in certain respects, are those for 
expression of polynucleotides and polypeptides of the present 
invention. Generally, such vectors comprise cis -acting control 
regions effective for expression in a host opera tively linked to 
the polynucleotide to be expressed. Appropriate trans -acting 
factors either are supplied by the host, supplied by a 
complementing vector or supplied by the vector itself upon 
introduction into the host. 

In certain preferred embodiments in this regard, the vectors 
provide for specific expression. Such specific expression may be 
inducible expression or expression only in certain types of cells 
or both inducible and cell -specif ic. Particularly preferred among 
inducible vectors are vectors that can be induced for expression 
by environmental factors that are easy to manipulate, such as 
temperature and nutrient additives. A variety of vectors suitable 
to this aspect of the invention, including constitutive and 
inducible expression vectors for use in prokaryotic and eukaryotic 
hosts, are well known and employed routinely by those of skill in 
the art. 

The engineered host cells can be cultured in conventional 
nutrient media, which may be modified as appropriate for, inter 
alia, activating promoters, selecting transformants or amplifying 
genes. Culture conditions, such as temperature, pH and the like, 
previously used with the host cell selected for expression 
generally will be suitable for expression of polypeptides of the 
present invention as will be apparent to those of skill in the art. 

A great variety of expression vectors can be used to express 
a polypeptide of the invention. Such vectors include chromosomal, 
episomal and virus -derived vectors e.g., vectors derived from 
bacterial plasmids, from bacteriophage, from yeast episomes, from 
yeast chromosomal elements, from viruses such as baculoviruses , 
papova viruses, such as SV40, vaccinia viruses, adenoviruses, fowl 
pox viruses, pseudorabies viruses and retroviruses, and vectors 
derived from combinations thereof, such as those derived from 
plasmid and bacteriophage genetic elements, such as cosmids and 
phagemids, all may be used for expression in accordance with this 
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aspect of the present invention. Generally, any vector suitable 
to maintain, propagate or express polynucleotides to express a 
polypeptide in a host may be used for expression in this regard. 

The appropriate DNA sequence may be inserted into the vector 
by any of a variety of well-known and routine techniques. In 
general, a DNA sequence for expression is joined to an expression 
vector by cleaving the DNA sequence and the expression vector with 
one or more restriction endonucleases and then joining the 
restriction fragments together using T4 DNA ligase. Procedures for 
restriction and ligation that can be used to this end are well 
known and routine to those of skill. Suitable procedures in this 
regard, and for constructing expression vectors using alternative 
techniques, which also are well known and routine to those skill, 
are set forth in great detail in Sambrook et al. cited elsewhere 
herein. 

The DNA sequence in the expression vector is operatively 
linked to appropriate expression control sequence (s), including, 
for instance. a promoter to direct mRNA transcription. 
Representatives of such promoters include the phage lambda PL 
promoter, the B. coli lac, trp and tac promoters, the SV40 early 
and late promoters and promoters of retroviral LTRs, to name just 
a few of the well-known promoters. It will be understood that 
numerous promoters not mentioned are suitable for use in this 
aspect of the invention are well known and readily may be employed 
by those of skill in the manner illustrated by the discussion and 
the examples herein. 

In general, expression constructs will contain sites for 
transcription initiation and termination, and, in the transcribed 
region, a ribosome binding site for translation. The coding 
portion of the mature transcripts expressed by the constructs will 
include a translation initiating AOG at the beginning and a 
termination codon appropriately posit .ned at the end of the 
polypeptide to be translated. 

In addition, the constructs may contain control regions that 
regulate as well as engender expression. Generally, in accordance 
with many commonly practiced procedures, such regions will operate 
by controlling transcription, such as repressor binding sites and 
enhancers, among others. 
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Vectors for propagation and expression generally will include 
selectable markers. Such markers also may be suitable for 
amplification or the vectors may contain additional markers for 
this purpose. In this regard, the expression vectors preferably 
contain one or more selectable marker genes to provide a phenotypic 
trait for selection of transformed host cells. Preferred markers 
include dihydrof olate reductase or neomycin resistance for 
eukaryotic cell culture, and tetracycline or ampicillin resistance 
genes for culturing E, coli and other bacteria. 

The vector containing the appropriate DNA sequence as 
described elsewhere herein, as well as an appropriate promoter , and 
other appropriate control sequences, may be introduced into an 
appropriate host using a variety of well known techniques suitable 
to expression therein of a desired polypeptide. Representative 
examples of appropriate hosts include bacterial cells, such as B. 
coli, Streptomyces and Salmonella typhimurium cells; fungal cells, 
such as yeast cells; insect cells such as Drosophila S2 and 
Spodoptera Sf9 cells; animal cells such as CHO, COS and Bowes 
melanoma cells; and plant cells. Hosts for of a great variety of 
expression constructs are well known, and those of skill will be 
enabled by the present disclosure readily to select a host for 
expressing a polypeptides in accordance with this aspect of the 
present invention. 

More particularly, the present invention also includes 
recombinant constructs, such as expression constructs, comprising 
one or more of the sequences described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into which 
such a sequence of the invention has been inserted. The sequence 
may be inserted in a forward or reverse orientation. In certain 
preferred embodiments in this regard, the construct further 
comprises regulatory sequences, including, for example, a promoter, 
operably linked to the sequence. Large numbers of suitable 
vectors and promoters are known to those of skill in the art, and 
there are many commercially available vectors suitable for use in 
the present invention. 

The following vectors, which are commercially available, are 
provided by way of example. Among vectors preferred for use in 
bacteria are pQE70, pQB60 and pQE-9, available from Qiagen; pBS 
vectors, Phagescript vectors, Bluescript vectors, pNHBA, pNH16a, 
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pNHlBA, pNH46A, available from Stratagene; and ptrc99a, pKK223-3, 
pKK233-3, pDR540, pRITS available from Pharmacia. Among preferred 
eukaryotic vectors are pWLNBO, pSV2CAT, pOG44 f pXTl and pSG 
available from Stratagene; and pSVK3, pBPV, pMSG and pSVL available 
from Pharmacia. These vectors are listed solely by way of 
illustration of the many commercially available and well known 
vectors that are available to those of skill in the art for use in 
accordance with this aspect of the present invention. It will be 
appreciated that any other plasmid or vector suitable for, for 
example, introduction, maintenance, propagation or expression of 
a polynucleotide or polypeptide of the invention in a host may be 
used in this aspect of the invention. 

Promoter regions can be selected from any desired gene using 
vectors that contain a reporter transcription unit lacking a 
promoter region, such as a chloramphenicol acetyl transferase 
("cat") transcription unit, downstream of restriction site or sites 
for introducing a candidate promoter fragment; i.e., a fragment 
that may contain a promoter. As is well known, introduction into 
the vector of a promoter -containing fragment at the restriction 
site upstream of the cat gene engenders production of CAT 
activity, which can be detected by standard CAT assays. Vectors 
suitable to this end are well known and readily available. Two 
such vectors are pKK232-8 and pO*7. Thus, promoters for expression 
of polynucleotides of the present invention include not only well 
known and readily available promoters, but also promoters that 
readily may be obtained by the foregoing technique, using a 
reporter gene. 

Among known bacterial promoters suitable for expression of 
polynucleotides and polypeptides in accordance with the present 
invention are the B. coli lad and lacZ and promoters, the T3 and 
T7 promoters, the gpt promoter, the lambda PR. PL promoters and the 
trp promoter. Among known eukaryotic promoters suitable in 

this regard are the OW immediate early promoter, the HSV thymidine 
kinase promoter, the early and late SV40 promoters, the promoters 
of retroviral LTRs , such as those of the Rous sarcoma virus 
("RSV"), and metallothionein promoters, such as the mouse 
metallothionein-I promoter. 

Selection of appropriate vectors and promoters for expression 
in a host cell is a well known procedure and the requisite 
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techniques for expression vector construction, introduction of the 
vector into the host and expression in the host are routine skills 
in the art. 

The present invention also relates to host cells containing 
the above-described constructs discussed above. The host cell can 
be a higher eukaryotic cell, such as a mammalian cell, or a lower 
eukaryotic cell, such as a yeast cell, or the host cell can be a 
prokaryotic cell, such as a bacterial cell. 

Introduction of the construct into the host cell can be 
effected by calcium phosphate trans f ection, DBAB-dextran mediated 
transf ection, cationic lipid-mediated transf ection, 
electroporation, transduction, infection or other methods. Such 
methods are described in many standard laboratory manuals, such as 
Davis et al. BASIC METHODS IN MOLECULAR BIOLOGY, (1966). 

Constructs in host cells can be used in a conventional manner 
to produce the gene product encoded by the recombinant sequence. 
Alternatively, the polypeptides of the invention can be 
synthetically produced by conventional peptide synthesizers. 

Mature proteins can be expressed in mammalian cells, yeast, 
bacteria, or other cells under the control of appropriate 
promoters. Cell-free translation systems can also be employed to 
produce such proteins using RNAs derived from the DNA constructs 
of the present invention. Appropriate cloning and expression 
vectors for use with prokaryotic and eukaryotic hosts are described 
by Sambrook et al., MOLECULAR CLONING: A LABORATORY MANUAL, 2nd 
Bd., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. 
(1989) . 

Generally, recombinant expression vectors will include origins 
of replication, a promoter derived from a highly -expressed gene to 
direct transcription of a downstream structural sequence, and a 
selectable marker to permit isolation of vector containing cells 
after exposure to the vector. Among suitable promoters are those 
derived from the genes that encode glycolytic enzymes such as 3- 
phosphoglycerate kinase ("PGR"), a-factor, acid phosphatase, and 
heat shock proteins, among others. Selectable markers include the 
ampicillin resistance gene of E . coli and the trpl gene of S. 
cerevisiae. 

Transcription of the DNA encoding the polypeptides of the 
present invention by higher eukaryotes may be increased by 
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inserting an enhancer sequence into the vector. Enhancers are cis- 
acting elements of DHA, usually about from 10 to 300 bp that act 
to increase transcriptional activity of a promoter in a given host 
cell-type. Examples of enhancers include the SV40 enhancer, which 
is located on the late side of the replication origin at bp 100 to 
270, the cytomegalovirus early promoter enhancer, the polyoma 
enhancer on the late side of the replication origin, and adenovirus 
enhancers . 

Polynucleotides of the invention, encoding the heterologous 
structural sequence of a polypeptide of the invention generally 
will be inserted into the vector using standard techniques so that 
it is operably linked to the promoter for expression. The 
polynucleotide will be positioned so that the transcription start 
site is located appropriately 5' to a ribosome binding site. The 
riboBome binding site will be 5' to the AUG that initiates 
translation of the polypeptide to be expressed. Generally, there 
will be no other open reading frames that begin with an initiation 
codon, usually AUG, and lie between the ribosome binding site and 
the initiating AUG. Also, generally, there will be a translation 
stop codon at the end of the polypeptide and there will be a 
polyadenylation signal and a transcription termination signal 
appropriately disposed at the 3* end of the transcribed region. 

For secretion of the translated protein into the lumen of the 
endoplasmic reticulum, into the periplasmic space or into the 
extracellular environment, appropriate secretion signals may be 
incorporated into the expressed polypeptide. The signals may be 
endogenous to the polypeptide or they may be heterologous signals. 

The polypeptide may be expressed in a modified form, such as 
a fusion protein, and may include not only secretion signals but 
also additional heterologous functional regions. Thus, for 
instance, a region of additional amino acids, particularly charged 
amino acids, may be added to the N-terminus of the polypeptide to 
improve stability and persistence in the host cell, during 
purification or during subsequent handling and storage. Also, 
region also may be added to the polypeptide to facilitate 
purification. Such regions may be removed prior to final 
preparation of the polypeptide. The addition of peptide moieties 
to polypeptides to engender secretion or excretion, to improve 
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stability and to facilitate purification, among others, are 
familiar and routine techniques in the art. 

Suitable prokaryotic hosts for propagation, maintenance or 
expression of polynucleotides and polypeptides in accordance with 
the invention include Bscherischia coli, Bacillus subtil is and 
Salmonella typhimurium. Various species of Pseudomonas, 
Streptomycee , and Staphylococcus are suitable hosts in this regard. 
Moreover, many other hosts also known to those of skill may be 
employed in this regard. 

As a representative but non- limiting example, useful 
expression vectors for bacterial use can comprise a selectable 
marker and bacterial origin of replication derived from 
commercially available plasma ds comprising genetic elements of the 
veil known cloning vector pBR322 {ATCC 37017) . Such commercial 
vectors include, for example, pKK223-3 (Pharmacia Fine Chemicals, 
Uppsala, Sweden) and GBM1 (Pr omega Biotec, Madison, HI, USA) . 
These pBR322 "backbone" sections are combined with an appropriate 
promoter and the structural sequence to be expressed. 

Following transformation of a suitable host strain and growth 
of the host strain to an appropriate cell density, where the 
selected promoter is inducible it is induced by appropriate means 
(e.g., temperature shift or exposure to chemical inducer) and cells 
are cultured for an additional period. 

Cells typically then are harvested by centrif ugation, 
disrupted by physical or chemical means, and the resulting crude 
extract retained for further purification. 

Microbial cells employed in expression of proteins can be 
disrupted by any convenient method, including freeze- thaw cycling, 
sonication, mechanical disruption, or use of cell lysing agents, 
such methods are well know to those skilled in the art. 

Various mammalian cell culture systems can be employed for 
expression, as well. Examples of mammalian expression systems 
include the COS-7 lines of monkey kidney fibroblast, described in 
Gluzman et al.. Cell 23: 175 (1981). Other cell lines capable of 
expressing a compatible vector include for example, the C127, 3T3, 
CHO, HeLa, human kidney 293 and BHK cell lines. 

Mammalian expression vectors will comprise an origin of 
replication, a suitable promoter and enhancer, and also any 
necessary ribosome binding sites, polyadenylation sites, splice 
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donor and acceptor sites, transcriptional termination sequences, 
and 5' flanking non- transcribed sequences that are necessary for 
expression. In certain preferred embodiments in this regard DNA 
sequences derived from the SV40 splice sites, and the SV40 
polyadenylation sites are used for required non -transcribed genetic 
elements of these types. 

The hBSP I. II and III polypeptide can be recovered and 
purified from recombinant cell cultures by well-known methods 
including ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction 
chromatography, affinity chromatography, hydroxylapatite 
chromatography and lectin chromatography. Most preferably, high 
performance liquid chromatography ("HPLC") is employed for 
purification. Well known techniques for refolding protein may be 
employed to regenerate active conformation when the polypeptide is 
denatured during isolation and or purification. 

Polypeptides of the present invention include naturally 
purified products, products of chemical synthetic procedures, and 
products produced by recombinant techniques from a prokaryotic or 
eukaryotic host, including, for example, bacterial, yeast, higher 
plant, insect and mammalian cells. Depending upon the host 
employed in a recombinant production procedure, the polypeptides 
of the present invention may be glycosylated or may be non- 
glycosylated. In addition, polypeptides of the invention may also 
include an initial modified methionine residue, in some caseB as 
a result of host -mediated processes. 

hBSP I. II and III polynucleotides and polypeptides may be 
used in accordance with the present invention for a variety of 
applications, particularly those that make use of the chemical and 
biological properties hBSP I. II and III. Additional applications 
relate to diagnosis and to treatment of disorders of cells, tissues 
and organisms. These aspects of the invention are illustrated 
further by the following discussion. 

Polynucleotide assays 

This invention is also related to the use of the hBSP I, II 
and III polynucleotides to detect complementary polynucleotides 
such as. for example, as a diagnostic reagent. Detection of a 
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mutated form of hBSP I, II and III associated with a dysfunction 
will provide a diagnostic tool that can add or define a diagnosis 
of a disease or susceptibility to a disease which results from 
under-expression, over -express ion or altered expression of hBSP I, 
II and III, such as, for example, a susceptibility to inherited 
asthma and endometrial cancer. 

Individuals carrying mutations in the human hBSP I, II and III 
gene may be detected at the DNA level by a variety of techniques. 
Nucleic acids for diagnosis may be obtained from a patient's cells, 
such as from blood, urine, saliva, tissue biopsy and autopsy 
material. The genomic DNA may be used directly for detection or 
may be amplified enzytnatically by using PCR prior to analysis. PGR 
(Saiki et al.. Nature, 324: 163-166 (1986)). RNA or cDNA may also 
be used in the same ways. As an example, PCR primers complementary 
to the nucleic acid encoding hBSP I, II and III can be used to 
identify and analyze hBSP I, II and III expression and mutations. 
Por example, deletions and insertions can be detected by a change 
in size of the amplified product in comparison to the normal 
genotype. Point mutations can be identified by hybridizing 
amplified DNA to radiolabeled hBSP I, II and III RNA or 
alternatively, radiolabeled hBSP I, II and III antisense DNA 
sequences. Perfectly matched sequences can be distinguished from 
mismatched duplexes by RNase A digestion or by differences in 
melting temperatures. 

Sequence differences between a reference gene and genes having 
mutations also may be revealed by direct DNA sequencing. In 
addition, cloned DNA segments may be employed as probes to detect 
specific DNA segments. The sensitivity of such methods can be 
greatly enhanced by appropriate use of PCR or another amplification 
method. Por example, a sequencing primer is used with double- 
stranded PCR product or a single-stranded template molecule 
generated by a modified PCR. The sequence determination is 
performed by conventional procedures with radiolabeled nucleotide 
or by automatic sequencing procedures with fluorescent -tags . 

Genetic testing based on DNA sequence differences may be 
achieved by detection of alteration in electrophoretic mobility of 
DNA fragments in gels, with or without denaturing agents. Small 
sequence deletions and insertions can be visualized by high 
resolution gel electrophoresis. DNA fragments of different 
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sequences may be distinguished on denaturing formamide gradient 
gels in which the mobilities of different DNA fragments are 
retarded in the gel at different positions according to their 
specific melting or partial melting temperatures (see, e.g., Myers 
et al., Science, 230: 1242 (1985)). 

Sequence changes at specific locations also may be revealed 
by nuclease protection assays, such as RNase and SI protection or 
the chemical cleavage method (e.g., Cotton et al., Proc. Natl. 
Acad. Sci.. USA, 85: 4397-4401 (1985)). 

Thus, the detection of a specific DNA sequence may be achieved 
by methods such as hybridization, RNase protection, chemical 
cleavage, direct DNA sequencing or the use of restriction enzymes, 

(e.g., restriction fragment length polymorphisms ("RFLP") and 
Southern blotting of genomic DNA. 

In addition to more conventional gel -electrophoresis and DNA 

sequencing, mutations also can be detected by in situ analysis. 

Chromosome assays 

The sequences of the present invention are also valuable for 
chromosome identification. The sequence is specifically targeted 
to and can hybridize with a particular location on an individual 
human chromosome. Moreover, there is a current need for 
identifying particular sites on the chromosome. Few chromosome 
marking reagents based on actual sequence data (repeat 
polymorphisms) are presently available for marking chromosomal 
location. The mapping of DNAs to chromosomes according to the 
present invention is an important first step in correlating those 
sequences with genes associated with disease. 

in certain preferred embodiments in this regard, the cDNA 
herein disclosed is used to clone genomic DNA of a hBSP I. II and 
III gene. This can be accomplished using a variety of well known 
techniques and libraries. which generally are available 
commercially. The genomic DNA the is used for in situ chromosome 
mapping using well known techniques for this purpose. Typically, 
in accordance with routine procedures for chromosome mapping, some 
trial and error may be necessary to identify a genomic probe that 
gives a good in situ hybridization signal. 

In some cases, in addition, sequences can be mapped to 
chromosomes by preparing PCR primers (preferably 15-25 bp) from the 
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cDNA. Computer analysis of the 3' untranslated region of the gene 
is used to rapidly select primers that do not span more than one 
exon in the genomic DNA, thus complicating the amplification 
process. These primers are then used for PCR screening of somatic 
cell hybrids containing individual human chromosomes. Only those 
hybrids containing the human gene corresponding to the primer will 
yield an amplified fragment. 

PCR mapping of somatic cell hybrids is a rapid procedure for 
assigning a particular DNA to a particular chromosome. Using the 
present invention with the same oligonucleotide primers, 
sublocalization can be achieved with panels of fragments from 
specific chromosomes or pools of large genomic clones in an 
analogous manner. Other mapping strategies that can similarly be 
used to map to its chromosome include in situ hybridization, 
prescreening with labeled flow-sorted chromosomes and preselection 
by hybridization to construct chromosome specif ic-cDKA libraries. 

Pluorescence in situ hybridization ("PISH") of a cDNA clone 
to a metaphase chromosomal spread can be used to provide a precise 
chromosomal location in one step. This technique can be used with 
cDNA as short as 50 or 60. Por a review of this technique, see 
Verma et al., HUMAN CHROMOSOMES: A MANUAL OP BASIC TECHNIQUES , 
Pergaraon Press, New York (1988). 

Once a sequence has been mapped to a precise chromosomal 
location, the physical position of the sequence on the chromosome 
can be correlated with genetic map data. Such data are found, for 
example, in V. McKusick, MENDSLIAN INHERITANCE IN MAN, available 
on line through Johns Hopkins University, Welch Medical Library. 
The relationship between genes and diseases that have been mapped 
to the same chromosomal region are then identified through linkage 
analysis (coinheritance of physically adjacent genes) . 

Next, it is necessary to determine the differences in the cDNA 
or genomic sequence between affected and unaffected individuals. 
If a mutation is observed in some or all of the affected 
individuals but not in any normal individuals, then the mutation 
is likely to be the causative agent of the disease. 

With current resolution of physical mapping and genetic 
mapping techniques, a cDNA precisely localized to a chromosomal 
region associated with the disease could be one of between SO and 
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500 potential causative genes. (This assumes 1 megabase mapping 
resolution and one gene per 20 kb) . 

Polypeptide assays 

The present invention also relates to a diagnostic assays such 
as quantitative and diagnostic assays Cor detecting levels of hBSF 
I, II and III protein in cells and tissues, and biological fluids 
such, for example, as blood and urine, including determination of : 
normal and abnormal levels. Thus, for instance, a diagnostic assay 
in accordance with the invention Cor detecting over-expression or 
under-expression oC hBSF I. II and III protein compared to normal 
control tissue samples may be used to detect the presence oC 
neoplasia. Cor example. Assay techniques that can be used to 
determine levels of a protein, such as an hBSF I. II and III 
protein of the present invention, in a sample derived from a host 
are well-known to those of skill in the art. Such assay methods 
include radioimmunoassays, competitive-binding assays, western Blot 
analysis and BLISA assays. Among these BLISAe frequently are 
preferred. An BLISA assay initially comprises preparing an 
antibody specific to hBSF I, II or III. preferably a monoclonal 
antibody. In addition a reporter antibody generally is prepared 
which binds to the monoclonal antibody. The reporter antibody is 
attached a detectable reagent such as radioactive, fluorescent or 
enzymatic reagent, for example horseradish peroxidase enzyme. 

To carry out an BLISA a sample is removed from a host and 
incubated on a solid support, e.g. a polystyrene dish, that binds 
the proteins in the sample. Any free protein binding sites on the 
dish are then covered by incubating with a non-specific protein 
such as bovine serum albumin. Next, the monoclonal antibody is 
incubated in the dish during which time the monoclonal antibodies 
attach to any hBSF I. II or III proteins attached to the 
polystyrene dish. Unbound monoclonal antibody is washed out with 
buffer. The reporter antibody linked to horseradish peroxidase is 
placed in the dish resulting in binding of the reporter antibody 
co any monoclonal antibody bound to hBSF I. II or III. Unattached 
reporter antibody is then washed out. Reagents for peroxidase 
activity, including a colorimetric substrate are then added to the 
dish Immobilized peroxidase, linked to hBSF I. II or III through 
the primary and secondary antibodies, produces a colored reaction 
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product. The amount of color developed in a given time period 
indicates the amount of hESF I, II or III protein present in the 
sample. Quantitative results typically are obtained by reference 
to a standard curve. 

A competition assay may be employed wherein antibodies 
specific to hBSF I, II or III attached to a solid support and 
labeled hBSF I, II or III and a sample derived from the host are 
passed over the solid support and the amount of label detected 
attached to the solid support can be correlated to a quantity of 
hBSF I, II or III in the sample. 

Antibodies 

The polypeptides, their fragments or other derivatives, or 
analogs thereof, or cells expressing them can be used as an 
immunogen to produce antibodies thereto. These antibodies can be, 
for example, polyclonal or monoclonal antibodies. The present 
invention also includes chimeric, single chain, and humanized 
antibodies, as well as Fab fragments, or the product of an Fab 
expression library. Various procedures known in the art may be 
used for the production of such antibodies and fragments. 

Antibodies generated against the polypeptides corresponding 
to a sequence of the present invention can be obtained by direct 
injection cf the polypeptides into an animal or by administering 
the polypeptides to an animal, preferably a nonhuman. The antibody 
so obtained will then bind the polypeptides itself. In this 
manner, even a sequence encoding only a fragment of the 
polypeptides can be used to generate antibodies binding the whole 
native polypeptides. Such antibodies can then be used to isolate 
the polypeptide from tissue expressing that polypeptide. 

For preparation of monoclonal antibodies, any technique which 
provides antibodies produced by continuous cell line cultures can 
be used. Examples include the hybridoma technique (Kohler, G. and 
Milstein. C, Nature 256: 495-497 (1975), the trioma technique, the 
human B -cell hybridoma technique (Koxbor et al.. Immunology Today 
4: 72 (1983) and the BBV-hybridoma technique to produce human 
monoclonal antibodies (Cole et al., pg. 77-96 in MONOCLONAL 
ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc. (1985). 

Techniques described for the production of single chain 
antibodies (U.S. Patent No. 4,946,778) can be adapted to produce 
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single chain antibodies to immunogenic polypeptide products of this 
invention. Also, transgenic mice, or other organisms such as other 
mammals. «"*y be used to express humanized antibodies to immunogenic 
polypeptide products of this invention. 

The above-described antibodies may be employed to isolate or 
to identify clones expressing the polypeptide or purify the 
polypeptide of the present invention by attachment of the antibody 
to a solid support for isolation and/or purification by affinity 
chromotography. 

Thus, among others, the polynucleotides and polypeptides of 
the present invention may be employed to prevent and/or treat 
inflammation, asthma, rhinitis, cystic fibrosis, airway disease, 
prevent and/or treat neoplasia, atopy, inhibit phospholipase A,, 
bind polychlorated biphenyls. reduce foreign protein antigenicity, 
inhibit monocyte and neutrophil chemotaxie and phagocytosis, 
inhibit platelet aggregation, regulate elcosanoid levels in the 
human uterus, control the growth of endometrial cells. 

hBSF I, II and III binding molecules and assays 
This invention also provides a method for identification of 
molecules, such as receptor molecules, that bind hBSP I. II and 
III Genes encoding proteins that bind hBSP I. II and III. such 
as receptor proteins, can be identified by numerous methods known 
to those of skill in the art. for example, ligand panning and PACS 
sorting. Such methods are described in many laboratory manuals 
such as. for instance. Coligan et al.. Current Protocols in 
immunology 1 12) = Chapters (1991). 

Por instance, expression cloning may be employed for this 
purpose. To this end polyadenylated RNA is prepared from a cell 
responsive to hBSP I. II and III. a cDKA library is created from 
this Mtt, the library is divided into pools and the pools are 
transfected individually into cells that are not responsive to hBSP 
I II and III. The transfected cells then are exposed to labeled 
hBSP I. II and III. (hBSP I. II and III can be labeled by » 
variety of well-known techniques including standard methods of 
radio-iodination or inclusion of a recognition site for a site- 
specific protein kinase.) Following exposure, the cells are fixed 
and binding of cytostatin is determined. These procedures 
conveniently are carried out on glass elides. 
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Pools are identified of cDNA chat produced hBSP I, II and III- 
binding cells. Sub-pools are prepared from these positives, 
transfected into host cells and screened as described above. Using 
an iterative sub-pooling and re-screening process, one or more 
single clones that encode the putative binding molecule, such as 
a receptor molecule, can be isolated. 

Alternatively a labeled ligand can be photoaf finity linked to 
a cell extract, such as a membrane or a membrane extract, prepared 
from cells that express a molecule that it binds, such as a 
receptor molecule. Cross-linked material is resolved by 
polyacrylamide gel electrophoresis ("PAGE") and exposed to X-ray 
film. The labeled complex containing the ligand- receptor can be 
excised, resolved into peptide fragments, and subjected to protein 
micro sequencing . The amino acid sequence obtained from 
microsequencing can be used to design unique or degenerate 
oligonucleotide probes to screen cDKA libraries to identify genes 
encoding the putative receptor molecule. 

Polypeptides of the invention also can be used to assess hBSP 
I, II and III binding capacity of hBSP I, II and III binding 
molecules, such as receptor molecules, in cells or in cell-free 
preparations . 

Agonicts and antagonists - assays and molecules 
The invention also provides a method of screening compounds 
to identify those which enhance or block the action of hBSP I, II 
and III on cells, such as its interaction with hBSP I, II and III- 
binding molecules such as receptor molecules. An agonist is a 
compound which increases the natural biological functions of hBSP 
I, II and III or which functions in a manner similar to hBSP I, II 
and III, while antagonists decrease or eliminate such functions. 

Por example, a cellular compartment, such as a membrane or a 
preparation thereof, such as a membrane -preparation, may be 
prepared from a cell that expresses a molecule that binds hBSP I, 
II and III, such as a molecule of a signaling or regulatory pathway 
modulated by hBSP I, II and III. The preparation is incubated with 
labeled hBSP I, II and III in the absence or the presence of a 
candidate molecule which may be a hBSP I, II and III agonist or 
antagonist. The ability of the candidate molecule to bind the 
binding molecule is reflected in decreased binding of the labeled 
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ligand. Molecules which bind gratuitously, i.e., without inducing 
the effects of hBSP I, II and III on binding the hBSP I, II and III 
binding molecule, are most likely to be good antagonists. 
Molecules that bind well and elicit effects that are the same as 
or closely related to hBSP I, II and III are agonists. 

hBSP I, II and Ill-like effects of potential agonists and 
antagonists may by measured, for instance, by determining activity 
of a second messenger system following interaction of the candidate 
molecule with a cell or appropriate cell preparation, and comparing 
the effect with that of hBSP I, II and III or molecules that elicit 
the same effects as hBSP I, II and III. Second messenger systems 
that may be useful in this regard include but are not limited to 
AMP guanylate cyclase, ion channel or phosphoinositide hydrolysis 
second messenger systems. 

Another example of an assay for hBSP I, II and III antagonists 
is a competitive assay that combines hBSP I, II and III and a 
potential antagonist with membrane -bound hBSP I, II and III 
receptor molecules or recombinant hBSP I, II and III receptor 
molecules under appropriate conditions for a competitive inhibition 
assay. hBSP I, II and III can be labeled, such as by 
radioactivity, such that the number of hBSP I. II and III molecules 
bound to a receptor molecule can be determined accurately to assess 
the effectiveness of the potential antagonist. 

Potential antagonists include small organic molecules, 
peptides, polypeptides and antibodies that bind to a polypeptide 
of the invention and thereby inhibit or extinguish its activity. 
Potential antagonists also may be small organic molecules, a 
peptide, a polypeptide such as a closely related protein or 
antibody that binds the same sites on a binding molecule, such as 
a receptor molecule, without inducing hBSP I, II and Ill-induced 
activities, thereby preventing the action of hBSP I, II and III by 
excluding hBSP I, II and III from binding. 

Potential antagonists include a small molecule which binds to 
and occupies the binding site of the polypeptide thereby preventing 
binding to cellular binding molecules, such as receptor molecules, 
such that normal biological activity is prevented. Bxamplee of 
small molecules include but are not limited to small organic 
molecules, peptides or peptide-like molecules. 
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Other potential antagonists include ant is ens e molecules. 
Ant is ens e technology can be used to control gene expression through 
ant is ens e DNA or RNA or through triple-helix formation. Antisense 
techniques are discussed, for example, in - Okano, J. Neurochem. 
56: 560 (1991); OLIGODBOXYNUCLBOTIDSS AS ANTISBNSE INHIBITORS OP 
GENB EXPRESSION, CRC Press, Boca Raton, PL (1988) . Triple helix 
formation is discussed in, for instance Lee et al., Nucleic Acids 
Research 6: 3073 (1979); Cooney et al.. Science 241: 456 (1988); 
and Dervan et al.. Science 251: 1360 (1991) . The methods are based 
on binding of a polynucleotide to a complementary DNA or RNA. Por 
example, the 5 # coding portion of a polynucleotide that encodes the 
mature polypeptide of the present invention may be used to design 
an antisense RNA oligonucleotide of from about 10 to 40 base pairs 
in length. A DNA oligonucleotide is designed to be complementary 
to a region of the gene involved in transcription thereby 
preventing transcription and the production of hBSP I, II and III. 
The antisense RNA oligonucleotide hybridizes to the mRNA in vivo 
and blocks translation of the mRNA molecule into hBSP I, II and III 
polypeptide. The oligonucleotides described above can also be 
delivered to cells such that the antisense RNA or DNA may be 
expressed in vivo to inhibit production of hBSP I, II and III. 

The antagonists may be employed in a composition with a 
phannaceutically acceptable carrier, e.g., as hereinafter 
described. 

The antagonists may be employed for instance to treat an 
inherited susceptibility to asthma. 

Compositions 

The invention also relates to compositions comprising the 
polynucleotide or the polypeptides discussed above or the agonists 
or antagonists. Thus, the polypeptides of the present invention 
may be employed in combination vith a non-sterile or sterile 
carrier or carriers for use with cells, tissues or organisms, such 
as a pharmaceutical carrier suitable for administration to a 
subject. Such compositions comprise, for instance, a media 
additive or a therapeutically effective amount of a polypeptide of 
the invention and a pharmaceutically acceptable carrier or 
excipient. Such carriers may include, but are not limited to, 
saline, buffered saline, dextrose, water, glycerol, ethanol and 
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combinations thereof. The formulation should Buit the mode of 
administration . 



Kits 

The invention further relates to pharmaceutical packs and kits 
comprising one or more containers filled with one or more of the 
ingredients of the aforementioned compositions of the invention. 
Associated with such container (s) can be a notice in the form 
prescribed by a governmental agency regulating the manufacture, use 
or sale of pharmaceuticals or biological products, reflecting 
approval by the agency of the manufacture, use or sale of the 
product for human administration. 

Administration 

Polypeptides and other compounds of the present invention may 
be employed alone or in conjunction with other compounds, such as 
therapeutic compounds. 

The pharmaceutical compositions may be administered in any 
effective, convenient manner including, for instance, 
administration by topical, oral, anal, vaginal, intravenous, 
intraperitoneal, intramuscular, subcutaneous, intranasal or 
intradermal routes among others. 

The pharmaceutical compositions generally are administered in 
an amount effective for treatment or prophylaxis of a specific 
indication or indications. In general, the compositions are 
administered in an amount of at least about 10 m9/*9 body weight. 
In most cases they will be administered in an amount not in excess 
of about 8 mg/kg body weight per day. Preferably, in most cases, 
dose is from about 10 pg/kg to about 1 mg/kg body weight, daily. 
It will be appreciated that optimum dosage will be determined by 
standard methods for each treatment modality and indication, taking 
into account the indication, its severity, route of administration, 
complicating conditions and the like. 

Gene therapy 

The hBSP I, II and III polynucleotides, polypeptides, agonists 
and antagonists that are polypeptides may be employed in accordance 
with the present invention by expression of such polypeptides in 
vivo, in treatment modalities often referred to as "gene therapy." 
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Thus, for example, cells from a patient nay be engineered with 
a polynucleotide, such as a DNA or RKA, encoding a polypeptide ex 
vivo, and the engineered cells then can be provided to a patient 
to be treated with the polypeptide. Por example, cells may be 
engineered ex vivo by the use of a retroviral plasmid vector 
containing RNA encoding a polypeptide of the present invention. 
Such methods are well-known in the art and their use in the present 
invention will be apparent from the teachings herein. 

Similarly! cells may be engineered in vivo for expression of 
a polypeptide in vivo by procedures known in the art. Por example, 
a polynucleotide of the invention may be engineered for expression 
in a replication defective retroviral vector, as discussed above. 
The retroviral expression construct then may be isolated and 
introduced into a packaging cell is transduced with a retroviral 
plasmid vector containing RNA. encoding a polypeptide of the present 
invention such that the packaging cell now produces infectious 
viral particles containing the gene of interest. These producer 
cells may be administered to a patient for engineering cells in 
vivo and expression of the polypeptide in vivo. These and other 
methods for administering a polypeptide of the present invention 
by such method should be apparent to those skilled in the art from 
the teachings of the present invention. 

Retroviruses from which the retroviral plasmid vectors herein 
above mentioned may be derived include, but are not limited to, 
Moloney Murine Leukemia Virus, spleen necroBis virus, retroviruses 
such as Rous Sarcoma Virus, Harvey Sarcoma Virus, avian leukosis 
virus, gibbon ape leukemia virus, human immunodeficiency virus, 
adenovirus, Myeloproliferative Sarcoma Virus, and mammary tumor 
virus. In one embodiment, the retroviral plasmid vector is derived 
from Moloney Murine Leukemia Virus. 

Such vectors well include one or more promoters for expressing 
the polypeptide. Suitable promoters which may be employed include, 
but are not limited to, the retroviral LTR; the SV40 promoter; and 
the human cytomegalovirus (CMV) promoter described in Miller et 
al., Biotechniques 7; 980-990 (19B9) , or any other promoter (e.g., 
cellular promoters such as eukaryotic cellular promoters including, 
but not limited to, the hietone, RNA polymerase III, and S-actin 
promoters) . Other viral promoters which may be employed include, 
but are not limited to, adenovirus promoters, thymidine kinase (TK) 
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promoters, and B19 parvovirus promoters. The selection of a 
suitable promoter will be apparent to those skilled in the art from 
the teachings contained herein. 

The nucleic acid sequence encoding the polypeptide of the 
present invention will be placed under the control of a suitable 
promoter. Suitable promoters which may be employed include, but 
are not limited to, adenoviral promoters, such as the adenoviral 
major late promoter; or heterologous promoters, such as the 
cytomegalovirus (CMV) promoter; the respiratory syncytial virus 
(RSV) promoter; inducible promoters, such as the MfT promoter, the 
metallothionein promoter; heat shock promoters; the albumin 
promoter; the ApoAI promoter; human globin promoters; viral 
thymidine kinase promoters, such as the Herpes Simplex thymidine 
kinase promoter; retroviral LTRs (including the modified retroviral 
LTRs herein above described) ; the 6- act in promoter; and human 
growth hormone promoters. The promoter also may be the native 
promoter which controls the gene encoding the polypeptide. 

The retroviral plasmid vector is employed to transduce 
packaging cell lines to form producer cell lines. Examples of 
packaging cells which may be trans fee ted Include, but are not 
limited to, the PB501, PA317, Y-2, Y-AM, PA12, T19-14X, VT-19-17- 
H2, YCRB, YCRIP. GP+B-86, GP+envAml2, and DAM cell lines as 
described in Miller, A., Human Gene Therapy 1: 5-14 (1990). The 
vector may be transduced into the packaging cells through any means 
known in the art. Such means include, but are not limited to, 
electroporation, the use of liposomes, and CaP04 precipitation. 
In one alternative, the retroviral plasmid vector may be 
encapsulated into a liposome, or coupled to a lipid, and then 
administered to a host. 

The producer cell line will generate infectious retroviral 
vector particles, which include the nucleic acid sequence (s) 
encoding the polypeptides. Such retroviral vector particles then 
may be employed to transduce eukaryotic cells, either in vitro 
in vivo. The transduced eukaryotic cells will express the nucleic 
acid sequence (s) encoding the polypeptide. Bukaryotic cells which 
may be transduced include, but are not limited to, embryonic stem 
cells, embryonic carcinoma cells, as well as hematopoietic Btem 
cells, hepatocytes, fibroblasts, myoblasts, keratinocytes, 
endothelial cells, and bronchial epithelial cells. 
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EXAMPLES 

The present invention is further described by the following 
examples. The examples are provided solely to illustrate the 
invention by reference to specific embodiments. These 
exem plification's, while illustrating certain specific aspects of 
the invention, do not portray the limitations or circumscribe the 
scope of the disclosed invention. 

Certain terms used herein are explained in the foregoing 
glossary. 

All examples were carried out using standard techniques, which 
are well known and routine to those of skill in the art, except 
where otherwise described in detail. Routine molecular biology 
techniques of the following examples can be carried out as 
described in standard laboratory manuals, such as Sambrook et al., 
MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Bd. ; Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y. (1989) , herein referred 
to as "Sambrook." 

All parts or amounts set out in the following examples are by 
weight, unless otherwise specified. 

Unless otherwise stated size separation of fragments in the 
examples below was carried out using standard techniques of agarose 
and polyacrylamide gel electrophoresis ("PAGE") in Sambrook and 
numerous other references such as, for instance, by Goeddel et al.. 
Nucleic Acids Res. 6: 4057 (1980). 

unless described otherwise, ligations were accomplished using 
standard buffers, incubation temperatures and times, approximately 
equimolar amounts of the DNA fragments to be ligated and 
approximately 10 units of T4 DNA ligase ("ligase") per 0.5 jig of 
DNA. 

Example 1 Expression and purification of human hBSP I, II and III 
using bacteria 

The DNA sequence encoding human hBSP I, II or III in the 
deposited polynucleotide was amplified using PCR oligonucleotide 
primers specific to the amino acid carboxyl terminal sequence of 
the human hBSP I. II or III protein and to vector sequences 3' to 
the gene. Additional nucleotides containing restriction sites to 
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facilitate cloning were added to the 5' and 3' sequences 

respectively. 

The 5' oligonucleotide primer had the sequence 

for hBSF I: 5' PG Q»CATGC ITGTCTGCCGAGCTO 3' (SBQ ID NO: 7) 
containing the underlined Sph I restriction site, which encodes a 
start AUG, followed by 15 nucleotides of the human hBSF I coding 
sequence net out in Figure 1 (SBQ ID NO:D , beginning with the 
first base of the codon for amino acid 22 (leucine). 

for hBSF II: 5' (^^^(TlTCrGaXAGCrC 3' (SBQ ID N0:8) 
containing the underlined Ncol restriction site, which encodes a 
start ATG, followed by 16 nucleotides of the human hBSF II coding 
sequence set out in Figure 2 (SBQ ID NO: 3), beginning with the 
first base of the codon for amino acid 22. 

for hBSF III: 5' CGC fififtJCSC ACT GCT ATG CAG ATT 3' (SBQ ID 
MO- 9) containing the underlined SphI restriction site, which 
encodes a start ATG. followed by 16 nucleotides of the human hBSF 
III coding sequence set out in Figure 3 (SBQ ID N0:5) . 

The 3' primer has the sequence 

for hBSF I 5' CGC^mCATnTTACATGTCA 3' (SBQ ID NO: 10) 
containing the underlined Hind III restriction site followed by 15 
nucleotides complementary to 15 nucleotides of the hBSF I non- 
coding sequence set out in Figure 1 (SBQ ID N0:1>, including the 
stop codon. 

for hBSF II 5' CGC6aSS3IAGTITrrACATOTCA 3' (SBQ -ID HO: 11) 
containing the underlined Hind III restriction site followed by 15 
nucleotides complementary to the last 15 nucleotides of the hBSF 
II non-coding sequence sec out in Figure 2 (SBQ ID KO:3) . including 
the stop codon. 

for hBSF III 5' CGC AAG CTT ACS CCT TGG GTA AAG TTA (SBQ 

ID H0:12> containing the underlined HlndlH restriction site 
followed by 18 nucleotides complementary to hBSF III non-coding 
sequence set out in Figure 3 (SBQ ID HO = S). including the stop 
codon. 

The restrictions sites were convenient to restriction enzyme 
sites in the bacterial expression vectors pQB-60 (hBSF I and II) > 
(Qiagen, Inc.) which were used for bacterial expression in these 
examples. (Qiagen. Inc. Chatsworth. CA) . pQB-60 encodes 
ampicillin antibiotic resistance CAmpr") and contains a bacterial 
origin of replication Cori") . an IPTG inducible promoter, a 
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ribosome binding site ("RBS") , a 6 -His tag and restriction enzyme 
sites . 

The amplif ied human hBSF I ( II and III UNA and the vector pQB- 
60 both were digested with Sph I and Bind III (hBSF I) , Nco I and 
Hindlll (hBSF II) and SphI and Hindi 1 1 (hBSF III) and the digested 
DMAs then were ligated together. Insertion of the hBSF I DNA. into 
the restricted vector placed the respective coding regions 
downstream of and operably linked to the vector's I PTG- inducible 
promoter and in- frame with an initiating AUG appropriately 
positioned for translation of hBSF I, II and III. 

The ligation mixture was transformed into competent E. coli 
cells using standard procedures. Such procedures are described in 
Sambrook et al., MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed.; 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. 
(1989) . B. coli strain M15/rep4, containing multiple copies of the 
plasmid pRBP4 f which expresses lac repressor and confers kanamycin 
resistance ("Kanr*), was used in carrying out the illustrative 
example described here. This strain, which is only one of many 
that are suitable for expressing hBSF I, II and III is available 
commercially from Qiagen. 

Trans formantfl were identified by their ability to grow on LB 
plates in the presence of ampicillin. Plasmid SNA was isolated 
from resistant colonies and the identity of the cloned DNA was 
confirmed by restriction analysis. 

Clones containing the desired constructs were grown overnight 
("O/N") in liquid culture in LB media supplemented with both 
ampicillin (100 ug/ml) and kanamycin (25 ug/ml) . 

The O/N culture was used to inoculate a large culture, at a 
dilution of approximately 1:100 to 1:250. The cells were grown to 
an optical density at 600nm ("OD600") of between 0.4 and 0.6. 
Isopropyl-B-D-thiogalactopyranoside ("IPTG") was then added to a 
final concentration of l mM to induce transcription from lac 
repressor sensitive promoters, by inactivating the lad repressor. 
Cells subsequently were incubated further for 3 to 4 hours. Cells 
then were harvested by centrifugation and disrupted, by standard 
methods. Inclusion bodies were purified from the disrupted cells 
using routine collection techniques, and protein was solubilized 
from the inclusion bodies into 8M urea. The 8M urea solution 
containing the solubilized protein was passed over a PD-10 column 
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in 2X phosphate buffered saline ("PBS") , thereby removing the urea, 
exchanging the buffer and refolding the protein. The protein was 
purified by a further step of chromatography to remove endotoxin. 
Then, it was eterile filtered. The sterile filtered protein 
preparation was stored in 2X PBS at a concentration of 95 

micrograms per mL. 

Analysis of the preparation by standard methods of 
polyacrylamide gel electrophoresis revealed that the preparation 
contained about 95% monomer hBSF I, II and III having the expected 
molecular weight. 

Example 2 Cloning and expression of human hBSF I, II and III in a 
baculovirus expression system 

The cDNA sequence encoding the full length human hBSF I, II 
and III protein, in the deposited clone is amplified using PCR 
oligonucleotide primers corresponding to the 5' and 3' sequences 
of the gene: 

for hBSF I the 5' primer has the sequence 5' CCCGOATCC 
GCCATCftlSAGGCTGTCAGTGTGTCT 3' (SBQ ID NO: 13) containing the BamHI 
restriction enzyme site (bold) followed by a kozak sequence (GCC 
ATC) and 20 bases of the sequence of hBSF I of Figure 1 (SBQ 10 
NO:l) ; 

for hBSF II the 5' primer has the sequence 5' CGC QOA TCC GCC 
ATC ATG AAG CTG TCG GTG 3' (SBQ ID N0:14) containing the BamHI 
restriction enzyme site (bold) followed by 15 bases of the sequence 
of hBSF II Of Figure 2 (SBQ ID NO; 3); 

for hBSF III the 5' primer has the sequence 5' CGC SSnJECC GCC 
ATC ATG AAG CTG CTG ATG GTC 3* (SBQ ID NO: 15) containing the BamHI 
restriction enzyme site (bold) followed by 15 bases of the sequence 
of hBSF III of Figure 3 (SBQ ID NO: 5) . 

inserted into an expression vector, as described below, the 
5* end of the amplified fragment encoding human hBSF I, II or III 
provides an efficient signal peptide. An efficient signal for 
initiation of translation in eukaryotic cells, as described by 
Kozak, M. , J. Mol. Biol. 196: 947-950 (1987) is appropriately 
located in the vector portion of the construct. 

For hBSF I the 3* primer has the sequence 5' CCCSZE&CC 
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TTTTTTrrTTTTTITTTT 3' (SBQ ID H0:16) containing the underlined 
Asp718 restriction site followed by 16 nucleotides complementary 
to the poly A tail; 

for hBSP II the 3' primer has the sequence 5' CGC 
S£X&££AGGCCTTGG(7IAAAGTTA 3' (SBQ ID NO: 17) containing the 
underlined Asp7i8 restriction followed toy nucleotides complementary 
to 15 nucleotides of the hBSP II non- coding sequence set out in 
Figure 2 (SBQ ID NO: 3) , including the stop codon; 

for hBSP III the 3' primer has the sequence 5' CGC GGT ACT ACG 
CCT TS6 GTA AAG TTA 3' (SBQ ID NO: 18) containing the underlined 
Asp7l8 restriction followed by nucleotides complementary to 18 
nucleotides of the hBSP III non-coding sequence set out in Figure 
3 (SBQ ID NO: 5) , including the stop codon. 

The amplified fragments are isolated from a 1% agarose gel 
using a commercially available Kit ("Geneclean, " BIO 101 Inc., La 
Jolla, Ca.). The fragments then are digested with the respective 
restriction enzymes and again are purified on a 1% agarose gel. 
This fragments are designated herein F2. 

The vector pRGl is used to express the hBSP I, II or III 
protein in the baculovirus expression system, using standard 
methods, such as those described in Summers et al, A MANUAL OF 
METHODS FOR BACULOVIRUS VECTORS AND INSECT CELL CULTURE PROCBDURBS, 
Texas Agricultural Experimental Station Bulletin No. 1555 (1987) . 
This expression vector contains the strong polyhedrin promoter of 
the Autographa califomica nuclear polyhedrosis virus (AcMNPV) 
followed by convenient restriction sites. The polyadenylation site 
of the simian virus 40 ("SV40") is used for efficient 
polyadenylation. For an easy selection of recombinant virus the 
beta-galactosidase gene from B.coli is inserted in the same 
orientation as the polyhedrin promoter and is followed by the 
polyadenylation signal of the polyhedrin gene. The polyhedrin 
sequences are flanked at both sides by viral sequences for cell- 
mediated homologous recombination with wild- type viral DNA to 
generate viable virus that express the cloned polynucleotide. 

Many other baculovirus vectors could be used in place of pA2, 
such as pAc373, pVL941 and pAcIMl provided, as those of skill 
readily will appreciate, that construction provides appropriately 
located signals for transcription, translation, trafficking and the 
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like. Such vectors are described in Luckow et al., Virology 170: 

31-39, among others. 

The plasmid is digested with the respective restriction 
enzymes and then is dephosphorylated using calf intestinal 
phosphatase, using routine procedures known in the art. The UNA 
is then isolated from a 1% agarose gel using a commercially 
available kit ("Geneclean" BIO 101 Inc., La Jolla, Ca.>. This 
vector CHX is designated herein *V2*. - 1 < 

Fragments P2 and the dephosphorylated plasmid V2 are ligated 
together with T4 DNA ligase. B.coli HB101 cells are transformed 
with ligation mix and spread on culture plates. Bacteria are 
identified that contain the plasmid with the human hBSP I , II or 
III gene by digesting DNA from individual colonies using the 
respective restriction enzymes and then analyzing the digestion 
product by gel electrophoresis. The sequence of the cloned 
fragment is confirmed by DNA. sequencing. This plasmid is 
designated herein pBachBSF I, II or III. 

5 ,ig of the plasmid pBachBSF I, II or III is co-transf ected 
with 1.0 M9 ot a commercially available linearized baculovirus DNA 
CBaculoGold™ baculovims DNA" , Pharmingen, San Diego, CA.), using 
the lipofection method described by Feigner et al., Proc. Natl. 
Acad. Sci. USA 84: 7413-7417 (1987) . lf*g Of BaculoGold" virus DMA 
and 5 us cf the plasmid pBachBSP I, II or III are mixed in a 
sterile well of a microtiter plate containing 50 fil of .serum free 
Grace's medium (Life Technologies Inc., Gaithersburg, MD) . 
Afterwards 10 fil Lipofectin plus 90 ul Grace's medium are added, 
mixed and incubated for 15 minutes at room temperature. Then the 
transfection mixture is added drop-wise to Sf 9 insect cells (ATCC 
CRL 1711) seeded in a 35 mm tissue culture plate with 1 ml Grace's 
medium without serum. The plate is rocked back and forth to mix 
the newly added solution. The plate is then incubated for 5 hours 
at 27 *C. After 5 hours the transfection solution is removed from 
the plate and l ml of Grace's insect medium supplemented with 10% 
fetal calf serum is added. The plate is put back into an incubator 
and cultivation is continued at 27 *C for four days. 

After four days the supernatant is collected and a plaque 
assay is performed, as described by Summers and Smith, cited above. 
An agarose gel with -Blue Gal- (Life Technologies Inc., 
Gaithersburg) is used to allow easy identification and isolation 



-54- 



WO 97/34997 



PCT/US96/W857 



of gal -expressing clones, which produce blue -stained plaques. (A 
detailed description of a "plaque assay" of this type can also be 
found in the user's guide for insect cell culture and 
baculovirology distributed by Life Technologies Inc. , Gaithersburg, 
page 9-10) . 

Pour days after serial dilution, the virus is added to the 
cells. After appropriate incubation, blue stained plaques are 
picked with the tip of an Bppendorf pipette. The agar containing 
the recombinant viruses is then resuspended in an Bppendorf tube 
containing 200 pi of Grace's medium. The agar is removed by a 
brief centrif ugation and the supernatant containing the recombinant 
baculovirus is used to infect Sf9 cells seeded in 35 mm dishes. 
Pour days later the supernatants of these culture dishes are 
harvested and then they are stored at 4'C. A clone containing 
properly inserted hESF I, II or III is identified by DNA analysis 
including restriction mapping and sequencing. This is designated 
herein as V-hBSP I, II or III. 

Sf9 cells are grown in Grace's medium supplemented with 10% 
heat -inactivated PBS. The cells are infected with the recombinant 
baculovirus V-hBSP I, II or III at a multiplicity of infection 
("MOI") of about 2. Six hours later the medium is removed and is 
replaced with SP900 II medium minus methionine and cysteine 
(available from Life Technologies Inc., Gaithersburg). 42 hours 
later, 5 pCi of 3 5S -methionine and 5 pCi 35S cysteine (available 
from Amersham) are added. The cells are further incubated for 16 
hours and then they are harvested by centrif ugation, lysed and the 
labeled proteins are visualized by SDS-PAGB and autoradiography. 

Example 3 Expression of hBSP I, I I and III in COS cells 

The expression plasmid, hBSP I, II and III HA, is made by 
cloning a cDNA encoding hBSP I , II and III into the expression 
vector pcCKAI/Amp {which can be obtained from Invitrogen, Inc.). 

The expression vector pcDNAI/amp contains: (1) an B.coli 
origin of replication effective for propagation in E. coli and 
other prokaryotic cell; (2) an ampicillin resistance gene for 
selection of plasmid- containing prokaryotic cells; (3) an SV40 
origin of replication for propagation in eukaryotic cells; (4) a 
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OW promoter, a polylinker, an SV40 intron, and a polyadenylation 
signal arranged so that a cDNA conveniently can be placed under 
expression control of the CMV promoter and operably linked to the 
SV40 intron and the polyadenylation signal by means of restriction 
sites in the poly linker. 

A DNA fragment encoding the entire hBSP I, II and III 
precursor and a HA tag fused in frame to its 3' end is cloned into 
the polylinlcer region of the vector so that recombinant protein 
expression is directed by the CMV promoter. The HA tag corresponds 
to an epitope derived from the influenza hemagglutinin protein 
described by Wilson et al.. Cell 37: 767 (1984) . The fusion of the 
HA tag to the target protein allows easy detection of the 
recombinant protein with an antibody that recognizes the HA 
epitope . 

The plasmid construction strategy is as follows: 

The hBSP I, II and III cDNA of the deposit clone is amplified 
uBing primers that contained convenient restriction sites, much as 
described above regarding the construction of expression vectors 
for expression of hBSP I. II and III in B. coli and S. fugiperda. 

To facilitate detection, purification and characterisation of 
the expressed hBSP I, II and III, one of the primers contains a 
heamaglutinin tag ("HA tag") as described above. 

Suitable primers include that following, which are used in 
this example: 

The 5' primer, containing the underlined BamHI site, an AUG 
start codon and has the following sequence. 5' CGC GGA TCC ACC ATC 
GTC TCG CTG GCC CTT 3' (SBQ ID N0:19) (BSFI) ; 5' CGC GGA TCC ACC 
ATG AAG CTG TOG GTG TGT 3* (SBQ ID N0:20) (BSPII) ; 5' CGC GGA TCC 
ACC ATG AAG CTG CTG ATG GTC 3* (SBQ ID NO: 21) (BSFIII) . 

The 3* primer, containing the underlined Xbal site, stop 
codon, HA tag and 15 bp of 3' coding sequence (at the 3' end) has 
the following sequence: 

5' CGC X£T TCA AGC GTA GTC TGG GAC GTC GTA TGG GTA CAC ACC ACA 
TTT TTT 3 (SBQ ID N0;22> (BSFI) ; 5' CGC TCT ftgft TCA AGC GTA GTC 
TGG GAC GTC GTA TGG GTA CAC ACT ACA TTT CTT 3' (SBQ ID N0:23) 

(BSPII) ; 5' CGC TCT AGA TCA AGC GTA GTC TGG GAC GTC GTA TGG GTA ATT 
ACT CTT CAT ATT 3' (SBQ ID NO: 24) (BSFIII) . 

The PCR amplified DNA fragment and the vector, pcDNAI/Amp, are 

digested with and then ligated. The ligation mixture is 
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transformed Into B. coli strain SURB (available from Stratagene 
Cloning Systems, 11099 North Torrey Pines Road, La Jolla, CA 92037) 
the transformed culture is plated on ampicillin media plates which 
then axe incubated to allow growth of ampicillin resistant 
colonies. Plasmid DNA is isolated from resistant colonies and 
examined by restriction analysis and gel sizing for the presence 
of the hBSF I # II and III -encoding fragment. 

Por expression of recombinant hBSF I, II and III/ COS cells T* 
are transfected with an expression vector, as described above, 
using DEAB-DEXTRAN, as described, for instance, in Sambrook et al., 
MOLECULAR CLONING: A LABORATORY MANUAL, Cold Spring Laboratory 
Press, Cold Spring Harbor, New York (1989). Cells are 

incubated under conditions for expression of hBSF I, II and III by 
the vector. 

Expression of the hBSF I, II and III HA fusion protein is 
detected by radiolabelling and .imnunoprecipitation, using methods 
described in, for example Harlow et al., ANTIBODIES : A LABORATORY 
MANUAL, 2nd Bd.; Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, New York (1988) . To this end, two days after transfection, 
the cells are labeled by incubation in media containing 35S- 
cysteine for 8 hours. The cells and the media are collected, and 
the cells are washed and the lysed with detergent -containing RIPA 
buffer: 150 mM NaCl, 1% NP-40, 0.1% SDS, II NP-40, 0.5% DOC, 50 mM 
TRIS, pH 7.5, as described by Wilson et al. cited above. Proteins 
are precipitated from the cell lysate and from the culture media 
using an HA-specific monoclonal antibody. The precipitated 
proteins then are analyzed by SDS -PAGE gels and autoradiography. 
An expression product of the expected size is seen in the cell 
lysate, which is not seen in negative controls. 

Example 4 Tissue distribution of hBSF I, II and III expression 

Northern blot analysis is carried out to examine the levels 
of expression of hBSF I, II and III in human tissues, using methods 
described by, among others, Sambrook et al, cited above. Total 
cellular RNA samples are isolated with RNAzol™ B system (Biotecx 
Laboratories, Inc. 6023 South Loop Bast, Houston, TX 77033) . 

About lOjig of Total RNA is isolated from tissue samples. The 
RNA is size resolved by electrophoresis through a 1% agarose gel 
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under strongly denaturing conditions. RKA is blotted from the gel 
onto a nylon filter, and the filter then iB prepared for 
hybridization to a detectably labeled polynucleotide probe. 

As a probe to detect mRNA that encodes hBSP I, II and ill, the 
antisense strand of the coding region of the cDKA insert in the 
deposited clone is labeled to a high specific activity. The cDNA 
is labeled by primer extension, using the Prime-It Kit, available 
from Stratagene. The reaction is carried out using 50 ng of the * 
cDNA, following the standard reaction protocol as recommended by 
the supplier. The labeled polynucleotide is purified away from 
other labeled reaction components by column chromatography using 
a Select-G-50 column, obtained from 5-Prime - 3-Prime, Inc. of 5603 
Arapahoe Road, Boulder, CO 80303. 

The labeled probe is hybridized to the filter, at a 
concentration of 1,000,000 cpm/ml, in a small volume of 7% SDS, 0.5 
M NaP04, pH 7.4 at 6S*C, overnight. 

Thereafter the probe solution is drained and the filter iB 
washed twice at room temperature and twice at 60'C with 0.5 x SSC. 
0.1% SDS. The filter then is dried and exposed to film at -70 C 
overnight with an intensifying screen. 

Example 5 Gene therapeutic expression of human hBSF I, II and III 

Fibroblasts are obtained from a subject by skin biopsy. The 
resulting tissue is placed in tissue -culture medium and separated 
into small pieces. Small chunks of the tissue are placed on a wet 
surface of a tissue culture flask, approximately ten pieces are 
placed in each flask. The flask is turned upside down, closed 
tight and left at room temperature overnight. After 24 hours at 
room temperature, the flask is inverted - the chunks of tissue 
remain fixed to the bottom of the flask - and fresh media is added 

(e g. , Ham's F12 media, with 10% FBS, penicillin and streptomycin) . 

The tissue is then incubated at 37'C for approximately one week. 

At this time, fresh media is added and subsequently changed every 

several days. After an additional two weeks in culture, a 

monolayer of fibroblasts emerges. The monolayer is trypsinized and 

scaled into larger flasks. 

A vector for gene therapy is digested with restriction enzymes 

for cloning a fragment to be expressed. The digested vector is 

-58- 



WO 97/34997 



PCT/US96/038S7 



treated with calf intestinal phosphatase to prevent self -ligation. 
The dephosphorylated, linear vector is fractionated on an agarose 
gel and purified. 

h&SP I, II and III cDNA capable of expressing active hESF I, 
II and III, is isolated. The ends of the fragment are modified, 
if necessary, for cloning into the vector. For instance, 5" 
overhanging may be treated with DNA polymerase to create blunt 
ends. 3 ' overhanging ends may be removed using Si nuclease. 
Linkers may be ligated to blunt ends with T4 DNA ligase. 

Equal quantities of the Moloney murine leukemia virus linear 
backbone and the hBSP I, II or III fragment are mixed together and 
joined using T4 DNA ligase. The ligation mixture is used to 
transform 2. Coli and the bacteria are then plated onto agar- 
containing kanamycin. Kanamycin phenotype and restriction analysis 
confirm that the vector has the properly inserted gene. 

Packaging cells are grown in tissue culture to confluent 
density in Dulbecco's Modified Bagles Medium (OMBM) with 10% calf 
serum (CS) , penicillin and streptomycin. The vector containing the 
hESF I, II or III gene is introduced into the packaging cells by 
standard techniques. Infectious viral particles containing the hESF 
I, II or III gene are collected from the packaging cells, which now 
are called producer cells. 

Fresh media is added to the producer cells, and after an 
appropriate incubation period media is harvested from. the plates 
of confluent producer cells. The media, containing the infectious 
viral particles, is filtered through a Millipore filter to remove 
detached producer cells. The filtered media then is used to infect 
fibroblast cells. Media is removed from a sub-confluent plate of 
fibroblasts and quickly replaced with the filtered media. 
Polybrene (Aldrich) may be included in the media to facilitate 
transduction. After appropriate incubation, the media is removed 
and replaced with fresh media. If the titer of virus is high, then 
virtually all fibroblasts will be infected and no selection is 
required. If the titer is low, then it is necessary to use a 
retroviral vector that has a selectable marker, such as neo or his, 
to select out transduced cells for expansion. 

Engineered fibroblasts then may be injected into rats, either 
alone or after having been grown to confluence on microcarrier 
beads, such as cytodex 3 beads. The injected fibroblasts produce 
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hBSF X. II or III product, and the biological actions o£ the 
protein are conveyed to the hoat. 

It will be clear that the invention nay be practiced otherwise 
than as particularly described in the foregoing description and 
exsns>X68 . 

Humerous modifications and variations of the present invention 
are possible in light of the above teachings and, therefore, are 
within the scope of the appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Oantx, Reiner 

(ii) TITLE 07 INVENTION: HUMAN ENDOMETRIAL SPECIFIC 
STEROID -BINDING FACTOR I, II AMD III 

(iii) NUMBER OF SEQUENCES: 27 

(It) CORRSSFOMDEHC! ADDRESS: 

(A) ADDRESSEE : CARBLLA, BYRNE, BAIN, GIL7ILLAN, CBCCHI, 

STEWART & OL5TEIN 

(B) STREET: 6 BECKER FARM ROAD 

(C) CITY: ROSBLAKD 

(D) STATE: NEW JERSEY 
(B) COUNTRY: USA 

(F) ZIP: 07068-1739 

<▼) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy dilfc 

(B) COMPUTER: IBM PC COnpatible 

(C) OPERATING SYSTEM: PC - DOS /MS -DOS 

(D) SOFTWARE: Patencln Re la a* a H.O, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Farraro, Gregory D 

(B) REGISTRATION NUMBER: 36,134 

(C) REFERENCE /DOCKET NUMBER: 325800-520 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 201-994-1700 

(B) TELEFAX: 201-994-1744 



(2) INFORMATION FOR SBQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 433 £*■• pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : ■ ingle 
(D> TOPOLOGY : linear 

(ii) MOLECULE TYPE: DMA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 43 . .312 

(ix) FEATURE: 

(A) NAME /KEY : eig^peptide 

(B) LOCATION: 43 . .105 

(ix) FEATURE: 

(A) NAME/KEY: mat_pepcide 

(B) LOCATION: 106.. 3 12 



(xi) SEQUENCE DESCRIPTION ; SEQ ID NO:l: 

TCACTCATTG TGAAAGCTGA GCTCACAGCC GAATAAGCCA CC ATG AGO CTG TCA 54 

Met Arg Leu Ser 
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-21 -20 



GTG TGT CTC CTG ATG GTC TCG CTG GCC CTT TGC TGC TAC CAG GCC CAT 102 
Val Cys I*«u L«u Met Val Ser Leu Ala Leu Cya Cys Tyr Gin Ala His 

.15 -10 * s 



30 



Cya Thr Aap Gin He Ser Phe Lya Lya Arg Leu Ser Leu Glu Lya Val 
45 50 55 

Leu val Glu He val Lya Lys Cya Gly val 
€0 « 

<2 ) INFORMATION FOR SHQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 36 baae pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

iii) MOLECULE TYPE: DNA (genomic) 



431 



GCT CTT GTC TGC CCA OCT GTT GCT TCT GAG ATC ACA GTC TTC TTA TTC ISO 
Ala Leu Val Cya Pro Ala Val Ala Ser Glu He Thr Val Phe Leu Phe 
1 5 10 15 

TTA ACT GAC OCT GCG GTA AAC CTC CAA GTT GCC AAA CTT AAT CCA CCT 190 
Lw St Mp JOa Alt Val Aan Leu Gin Val Ala Lys Leu Aan Pro Pro 

30 25 30 - 

CCA GAA OCT CTT GCA GCC AAG TTG GAA GTG AAG CAC TGC ACC GAT CAG 246 
Pro Glu Ala Leu Ala Ala Lya Leu Glu Val Lys Hia Cya Thr Aap Gin 
35 «0 « 

ATA TCT TTT AAG AAA CGA CTC TCA TTG GAA AAA GTC CTG GTG GAA ATA 294 
lie ser Phe Lya Lys Arg Uu Ser Leu Glu Lya Val Leu Val Glu lie 
50 55 «0 

GTG AAA AAA TGT GGT GTG TGACATGTAA AAATGCTCAA CCTGGTTTCC 3*2 
Val Lya Lya Cys Gly Val 
65 

AJUU3TCTTTC AACOACACCC TGATCTTCAC TAAAAATTGT AAAGGTTTCA ACACGTTGCT 402 
TTAATAAATC ACTTOCCCTG CACATCAAAA A 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Arg Leu Ser Val Cya Leu Leu Met Val Ser Leu Ala Leu Cys Cya 

-21 -20 -15 * 10 

Tyr Gin Ala Hia Ala Leu Val Cya Pro Ala Val Ala Sex Glu lie Thr 

* 5 1 5 10 

Val Phe Leu Phe Leu Ser Aap Ala Ala Val Aan Leu Gin Val Ala Lya 
15 20 25 

Leu Asn Pro Pro Pro Glu Ala Leu Ala Ala Lys Leu Glu Val Lya His 
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(ix) FEATURE : 

(A) MAKE /KEY : CDS 

(B) LOCATION: 40.. 309 

(ix) FEATURE: 

(A) NAME /KEY : aigjeptide 

(B) LOCATION: 40.. 102 

(IX) FEATURE : 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 103.. 309 



(xit SEQUENCE DESCRIPTION: SBQ ID NO:3: 

TTGTTTGTCA AASCTSAGCT CACAGCAAAA CAAGCCACC ATG AAC CTQ TOS GTG 54 

Met Lys Leu Ser val 
-21 -20 

TOT CTC CTG CT0 GTC ACS CTO GCC CTC TQC TQC TAC CAG GCC AAT GCC 102 
Cys Leu Leu Leu Val Tbr Leu Ala Leu Cys Cys Tyr Qln Ala Aan Ala 
-IS -10 -5 

GAG TTC TGC CCA OCT CTT GTT TCT GAG CTG TTA GAC TTC TTC TTC ATT ISO 
Glu Phe Cys Pro Ala Leu Val Ser Glu Leu Leu Asp Phe Phe Phe He 
1 5 10 15 

AGT GAA CCT CTO TTC AAG TTA ACT CTT GCC AAA TTT GAT GCC CCT COG 198 
Ser Glu Pro Leu Phe Lys Leu Ser Leu Ala Lys Phe Asp Ala Pro Pro 
20 25 30 

GAA GCT GTT GCA GCC AAG TTA GGA GTG AAG AGA TGC ACQ GAT CAG ATG 246 
Glu Ala Val Ala Ala Lys Leu Gly Val Lys Arg Cys Thr Asp Gin Met 
35 40 45 

TCC CTT CAG AAA GGA AGC CTC ATT GCG GAA GTC CTG GTG AAA ATA TTG 294 
Sex Leu Gin Lys Arg Ser Leu He Ala Glu Val Leu Val Lys Ha Leu 
50 55 60 

AAG AAA TGT AGT GTG TGACATGTAA AAACTTTCAT CCTGGTTTCC ACTGTCTTTC 349 
Lys Lys Cys tier Val 
65 

AATGACACCC TGATCTTCAC TGCAGAATGT AAAGGTTTCA ACGTCTTGCT TTAATAAATC 409 
ACTTGCTCTC CAAAAAAAAA AAAAAAA 436 

(2) INFORMATION FOR SEQ ID NO: 4: 

<i> SEQUENCE CHARACTERISTICS : 

<A) LENGTH: 90 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 4: 

Met Lys Leu Ser Val Cys Leu Leu Leu Val Thr Leu Ala Leu Cys Cys 

-21 -20 -15 -10 

Tyr Gin Ala Aan Ala Glu Phe Cys Pro Ala Leu Val Ser Glu Leu Leu 

-5 1 s 10 

Asp Phe Phe Phe He Ser Glu Pro Leu Phe Lys Leu Ser Leu Ala Lvs 
15 20 25 

Phe Asp Ala Pro Pro Glu Ala Val Ala Ala Lys Leu Gly Val Lys Arg 
30 35 40 
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Cys Thr Asp Gin Mat Ser Leu Gin Lya Arg Ser Leu He Ale Glu Val 
45 50 5S 

Leu Val Lye He L«u Lya Lya Cya Sar Val 
60 65 

(2) INFORMATION FOR SZQ ID NO;S: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 476 baae pairs 

(B) TYPE: nucleic acid 

(C) STBAKDEDNBSS: aingle 
(0) TOPOLOGY: linear 

<ii> MOLECULE TYPE: DBA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 46.. 330 

(ix) FEATURE: 

(A) NAME /KEY: aig_peptide 
<B) LOCATION: 46.. 108 

(ix) FEATURE: 

(A) NAME /KEY: oatjpeptide 
(B>) LOCATION: 109.. 330 

(xi) SEQUBNCB DESCRIPTION: SBQ ID NO: 5: 
ACGAOCTGCC ACGCACGACT OAACACAGAC AGCAGCCGCC TOGCC ATG AAG CTG 

-21 -20 

CTG ATG GTC CTC ATG CTG GCO GCC CTC CTC CTG CAC TGC TAT OCA GAT 102 
Leu Met Val Leu Met Leu Ala Ala Leu Leu Leu Hia Cy» Tyr Ala Asp 

-15 -I© " 5 

150 

196 

246 

294 

340 

400 
460 
476 



T£T GGC TGC AAA CTC CTG GAG GAC ATG GTT GAA AAG ACC ATC AAT TCC 
leT G?5 Cya Lya Leu £u Glu Aap Met Val Glu Lya Thr He Aan Sar 
1 5 1° 

ear ATA TCT ATA CCT GAA TAC AAA GAG CTT CPT- CAA GAG TTC ATA GAC 
Sp IU IS lie Pre- Glu Tyr Lya Glu Leu Leu Gin Glu Phe He Aap 
15 20 25 

AGT GAT GCC GCT GCA GAG OCT ATG GGO AAA TTC AAG CAG TGT TTC CTC 
7Vz Aap Ala Ala Ala Glu Ala Mac Gly Lya Phe Lya Gin Cya Phe Leu 
35 40 ** 

AAC CAG TCA CAT AGA ACT CTG AAA AAC TTT GGA CTG ATG ATG CAT ACA 
Asn Gin Ser Hia Arg Thr Leu Lya Aan Phe Gly Leu Met Met Hia Thr 

SO 55 60 

GTG TAC GAC AGC ATT TGG TGT AAT ATG AAG AGT AAT TAACTTTACC 
val Tyr Asp Ser Il« Trp Cya Aan Met Lys Sar Aan 

65 70 
CAAGGCGTTT GGCTCAGAGG GCTACAGACT ATGGCCAGAA CTCATCTGTT GATTGCTAGA 
AACCACTTTC TTCTTGTGTT GCTTTTTATC TGGGAACTGC TAGACAACTO TTGAAACCTC 
AATTCATTCC ATTTCA 

(2) INFORMATION FOR SEQ ID NO:6: 

ti) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 95 amino acids 

(B) TYPE: Amino acid 
CO) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SSQ ID NO: 6: 

Met Lys Leu Leu Met Val Leu Met Leu Ala Ala Leu Leu Leu His Cys 

-21 -20 -15 -10 

Tyr Ale Ajp Ser Gly Cys Lys Leu Leu Glu Asp Met Val Glu Lys Thr 
. -S 1 5 10 

lie Asn Ser Asp Zle Ser He Pro Glu Tyr Lys Glu Leu Leu Gin Glu 
15 20 25 

Phe He Asp Ser Asp Ala Ala Ala Glu Ala Mae Gly Lys Phe Lys Gin 
30 35 40 

Cys Phe Leu Asn Gin Ser His Arg Thr Leu Lys Asn Phe Gly Leu Mat 
45 SO 55 

Met His Thr Val Tyr Asp Ser He Trp Cys Asn Met Lys Ser Asn 

60 €5 70 

(2) INFORMATION FOR SBQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 
(O) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
CGCGCATGCT TGTCTGCCCA GCTG 
(2) INFORMATION FOR SBQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nueleic acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CGCCCATGGA GTTCTGCCCA GCTC 
(2) INFORMATION FOR SBQ ID NO: 9: 

(!) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDBDNESS: Single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CGCGCATGCA CTGCTATGCA GAIT 
(2) INFORMATION FOR SEQ 10 NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pair* 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:10: 
CaCAAOCTTC ATTTTTACAT GTCA 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO-.ll: 
CGCAAGCTTA GTTTTTACAT GTCA 
(2) INFORMATION FOR SEQ ID NO: 12: 

U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 
(Di TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 12 
CGCAAGCTTA CGCCTTOGGT AAAGTTA 
(2) INFORMATION FOR SEQ ID NO: 1J : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 
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<xi) SEQUENCE DESCRIPTION: SEQ ZD NO: 13 
CCCGGATCCG CCATCATGAG GCTGTCAGTG TGTCT 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic add 

(C) STRAND BDNESS : single 
(0) TOPOLOGY: linear 

<ii) MOLICOLS TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
CGCGGATCCG CCATCATGAA GCTGTCGGTG 
(2) INFORMATION FOR SEQ ID NO: IS: 

U) SBQUSNCB CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDSDNBSS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 15 
CGCGGATCCG CCATCATGAA GCTGCTGATG GTC 
{2) INFORMATION FOR SEQ ID NO: 16: 

(i) SBQUSNCB CHARACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDKESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION. SEQ ID NO: 16: 
CCCGGTACCT UUTmn 1UUI1 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
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CGCGGTACCA CGCCTTGGGT AAAGTTA 
(2) INFORMATION FOR SBQ ID MO: 18: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SBOUBNCB DESCRIPTION: SBQ ID NO:18: 
COOSOTRACA CGCCTTGGGT AAAGTTA 
(3) INFORMATION FOR SBQ ID 110:19: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) M0LBCDL8 TYPB: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CGCOGATCCA C CA TGQTCTC OCTOOCCCTT 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 30 base pairs 
(B> TYPE: nucleic acid 
<C> STRANDEDNESS : Single 
(0) TOPOLOGY : linear 

(ii) HOLE COLE TYPB: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CGCGGATCCA CCATGAAGCT GTCGGTGTGT 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic scid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
CGCGGATCCA CCATGAAGCT GCTGATGGTC 
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(2) INFORMATION FOR SBQ ID MO: 22: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDNBSS : tingle 

(D) TOPOLOGY: linear 

(11) HOLBCOLB TYPE: other nuclei e acid 



(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 22: 

CGCTCTAGAT CAAGCQTAGT CTGGGACGTC GTATGGGTAC ACACCACATT TTTT 54 

(2) INFORMATION FOR SBQ ID HO: 23: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 54 base pairs 
IB) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

{ii> MOLECULE TYPE: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 23: 

CGCTCTAGAT CAAGCQTAGT CTGGGACGTC GTATGGGTAC ACACTACATT TCTT 54 

(2) INFORMATION FOR SBQ ID NO: 24 : 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: S4 base pairs 
(8) TYPE: nucleic acid 
(C> STRAND ED NESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

CGCTCTAGAT CAAGCQTAGT CTGGGACGTC GTATGGGTAA TTACTCTTCA TATT 54 

(2) INFORMATION FOR SBQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 
(8) TYPE : amino acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 25: 

lie Glu Leu Ser Leu Cys Leu Leu He Met Leu Ala Val Cys Cys Tyr 
1 S 10 15 

Glu Ala Asn Ala Ser Gin He Cys Glu Leu Val Ala His Glu Thr He 
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20 « 30 

Ser Phe Leu Met Lys Ser Glu Glu Glu Leu Lye Lye Glu Leu Glu Met 
35 40 « 

Tyr AinAlt Pro Pro Ale Ale Val Glu Ala Lye Leu Glu Val Lye Arg 
^ 50 55 60 

Cye val Asp Gin Met Ser Aen Gly Asp Arg Leu val Vel Ala Glu Thr 
65 70 75 

Leu Val Tyr He Pbe Leu Glu Cye Gly Val 

as »o 

(2) INTOHMATION FOR SEQ 10 NO: 26: 

til SBQUSNCB CHARACTERISTICS : 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRAND BDNBSS : single 
(O) TOPOLOGY: linear 

(ii) MOLECULE TYPE": protein 

txi) SEQUENCE DESCRIPTION: SBQ ID KO:26: 

He Olu Leu Ser Leu Cys Leu Leu He Met Leu Ale val Cye Cys Tyr 

1 5 10 " 

Glu Ala Asn Ala Ser Gin He Cys Glu Leu Val Ala Hie Glu Thr He 
20 " * u 

Ser Phe Leu Met Lys Ser Olu Glu Glu Leu Lys Lys Glu Leu Olu Met 
35 40 « 

Tyr Aen Ale Pro Pro Ala Ala Val Glu Ala Lye Leu Glu Vel Lye Arg 



50 



Cys Val Asp Gin Met Ser Asn Gly Asp Arg Leu Vel Vel Ala Glu Thr 

65 70 5 



Leu Val Tyr He Phe Leu Glu Cye Gly Vel 
85 90 

(2) INFORMATION POR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acide 

(B) TYPE: emino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY:- linear 

(ii) MOLBCOLB TYPE: protein 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 27: 

Met Lys Leu Val Phe Leu Phe Leu Leu Val Thr He Pro lie Cys Cys 

! 5 10 

Tyr Ale Ser Gly Ser Gly Cys Ser lie Leu Asp Glu Vel He Arg Gly 



20 25 30 

. .fW- Mm 1 Thr T.f»\i His ASO TVT Met ^ 

70 
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Pro Tyr Val Gin Ala His Pha Tbr Glu Lya Ala Val Lya Gin pha Lys 

50 55 60 

Gin Cys Pha Lau Asp Gin Thr Asp Lys Tbr Leu Glu Asn Val Gly Val 
65 70 75 80 

Met Met Glu Ala II a Fhe Asn Ser Glu Ser Cys Gin Gin Pro Ser 
65 90 95 
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WHAT IS CLAIMED IS: 

1. An isolated polynucleotide comprising a member selected from 
the group consisting of: 

(a) a polynucleotide having at least a 70% identity to a 
polynucleotide encoding a polypeptide comprising amino acid 1 to 
69 Of SBQ ID N0:2; 

(b) a polynucleotide having at least a 70% identity to a 
polynucleotide encoding a polypeptide comprising amino acid £ to 
amino acid 69 set forth in SBQ ID NO: 4; 

(c) a polynucleotide having at least a 70% identity to a 
polynucleotide encoding a polypeptide comprising amino acid 1 to 
amino acid 74 set forth in SBQ ID NO: 6; 

(d) a polynucleotide which is complementary to the 
polynucleotide of (a) , tb) or (c> ; and 

(e) a polynucleotide comprising at least 15 bases of the 
polynucleotide of (a) , ib) , CO or (d) . 

2. The polynucleotide of Claim l wherein the polynucleotide is 
DNA. 

3. The polynucleotide of Claim 1 wherein the polynucleotide is 

RKA. 

4. The polynucleotide of Claim 1 wherein the polynucleotide is 
genomic DNA. 

5. The polynucleotide of Claim 2 which encodes the polypeptide 
comprising amino acid 1 to 69 of SBQ ID NO: 2. 

6. The polynucleotide of Claim 2 which encodes the polypeptide 
comprising amino acid 1 to 69 of SBQ ID NO: 4. 

7. The polynucleotide of Claim 2 which encodes the polypeptide 
comprising amino acid 1 to 74 of SEQ ID NO: 6. 

8 . An isolated polynucleotide comprising a member selected from 
the group consisting of: 
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(a) a polynucleotide which encodes a nature polypeptide having 
the amino acid sequence expressed by the human cDNA contained in 
ATCC Deposit No. 97401; 

(b) a polynucleotide which encodes a mature polypeptide having 
the amino acid sequence expressed by the human cDNA contained in 
ATCC Deposit No. 97402; 

(c) a polynucleotide which encodes a mature polypeptide having 
the amino acid sequence expressed by the human cDNA contained in 
ATCC Deposit No. 97403; 

(d) a polynucleotide which is complementary to the 
polynucleotide of (a) , (b) or (c) ; and 

(e) a polynucleotide comprising at least 15 bases of the 

polynucleotide of (a), (b) , (c) or (d> . 

f ■ 

9. The polynucleotide of claim l comprising nucleotide 106 to 
nucleotide 312 of SBQ ZD NO:l. 

10. The polynucleotide of claim l comprising nucleotide 103 to 
nucleotide 309 of SBQ ID NO: 3. 

11. The polynucleotide of claim l comprising nucleotide 109 to 
nucleotide 330 of SBQ ID NO: 5. 

12. A vector comprising the DNA of Claim 2. 

13. A host cell comprising the vector of Claim 12. 

14. A process for producing a polypeptide comprising: expressing 
from the host cell of Claim 13 the polypeptide encoded by said DNA. 

15. A process for producing a cell which expresses a polypeptide 
comprising genetically engineering the cell with the vector of 
Claim 12 such that the cell expresses the polypeptide encoded by 
the huma cDNA in the vector. 

16. A polypeptide comprising a member selected from the group 
consisting of: 

(a) a polypeptide comprising amino acid 1 to 69 of SBQ ID NO:2,- 

(b) a polypeptide comprising amino acid 1 to 69 of SBQ ID NO: 4; 
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(c) a polypeptide comprising amino acid 1 to 74 of SBQ ID NO. 6,- 

(d) a polypeptide which is at least 70% identical to the 
polypeptide of (a), (b) or (c) . 

17. The polypeptide of Claim 16 wherein the polypeptide comprises 
amino acid 1 to amino acid 69 of SBQ ID NO: 2. 

18. The polypeptide of Claim 16 wherein the polypeptide comprises 
amino acid 1 to amino acid 69 of SBQ ID N0:4. 

19. The polypeptide of Claim 16 wherein the polypeptide comprises 
amino acid 1 to amino acid 74 of SBQ ID K0:6. 

20. A compound which inhibits activation of the polypeptide of 
claim 16. 

21. A method for the treatment of a patient having need of hBSP 
I, II or III comprising: administering to the patient a 
therapeutically effective amount of the polypeptide of claim 16. 

22 The method of Claim 21 wherein said therapeutically effective 
amount of the polypeptide is administered by providing to the 
patient SKA encoding said polypeptide and expressing said 
polypeptide in vivo. 

23 A method for the treatment of a patient having need to inhibit 
a hBSP I, II or III polypeptide comprising: administering to the 
patient a therapeutically effective amount of the compound of Claim 



20. 
24. 



„ A process for diagnosing a disease or a susceptibility to a 
disease related to an under- expression of the polypeptide of claim 
16 comprising: 

determining a mutation in a nucleic acid sequence encoding said 
polypeptide. 

25 A diagnostic process comprising: 

analyzing for the presence of the polypeptide of claim 16 in a 
sample derived from a host. 
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26. A method for identifying compounds which bind to and inhibit 
activation of the polypeptide of claim 16 comprising: contacting 
a cell expressing on the surface thereof a receptor for the 
polypeptide, said receptor being associated with a second component 
capable of providing a detectable signal in response to the binding 
of a compound to said receptor, with an analytically detectable 
hBSF I, II or III polypeptide and a compound under conditions to. 
permit binding to the receptor; and 

determining whether the compound binds to and inhibits the 
receptor by detecting the absence of a signal generated from the 
Interaction of the hBSF I, II or III with the receptor. 
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claims. 



Q As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 

0 As only some of (he required additional search fees were timely paid by the applicant, this international search report covers 
only those claims for which fees were paid, specifically claims Nos.: 
1-19 (group 1} and species b-c 



I I No required sdditiona! search fees were timely paid by the applicant. Consequently, this international search report is 
restricted lo the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest Q The additional search fees were accompanied by the applicant's protest. 

No protest accompanied the payment of additional search fees. 
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A. CLASSIFICATION OP SUBJECT MATTER: 
IPC (6): 

CI2N 1/20, 5/00, 15/00; C12P 21/06; COIN 33/53; C07K 1/00. 14/00. 21/04; A61K 38/00 

A. CLASSIFICATION OF SUBJECT MATTER: 
US CL : 

433/7.1. 69.1, 240.1. 252.3. 320.1; 530/ 3W, 324, 350; 930/ 10; 536/23.5 

B, FIELDS SEARCHED . ^ „ 
Electronic data bases consulted (Name of data base tod whew practicable term wed). 

APS. USPAT. JPOABS; STN, MEDUNE. EM BASE, BIOSIS. CAPLUS. CONFSCI, DISSABS. J1CST-EPLUS, 

^^^Lrogtabin. otemglobuvlike. PGR, degenerate PCR cloning. cclO. clar. cell 17 kDa protein, 
synonyms and authors 

BOX U. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 



1. This International Search Authority has found 7 invattioas 
claimed in the International Application covered by the claims 
indicated below: 

This application contains the following invention* or groups of 
inventions which are not so linked as to form a single inventive 
concept under PCT Rule 13.1. In order for all inventions to be 
examined, the appropriate addi tion a l examination fees must be 
paid. 

Group I, claims 1-19, are drawn to an isolated polynucleotide 
encoding a polypeptide, a vector, host cell, process for 
producing a ceU and method of recombinanUy expressing human 
endometrial speefce steroid-binding factor (hESF). 
Group I contains claims directed to more than one species of the 
generic invention. The*: .pecks are deemed to lack Unky of 
Invention becawe they are not so linked as to form a single 
inventive concept under PCT Rule 13.1. In order for more than 
one species to be examined, the appropriate additional 
examination fees must be paid. The species are aa follows: 

a) the polynucleotide/ AA encoding hESF (.(claims 5. 9, 17) 

b) the polyniicleotidc/AA encoding hESF H.tclaims 6. 10. 18) u.i6Th* 

c) the polynucleotide/AA encoding hESF H. (Claim. 7. 11. l9)The following claims are genene: 1-4. 8. lMoThe 
species listed above do not relate to a single inventive 

concept under PCT Rule 13.1 because, under PCT Rule 13.2. the 
species lack the same or corresponding special technical 
fcaiuresfor the following reasons: 

TbenucelicacidencodingtheAAofhESFI of species a. the nucclic 
acid encoding the AA of hESF II of species b. and the nucclic 
acid encoding the AA of hESF 111 of species c, each have 
materially different chemical structures and materially different 
functional properties. These chemical structures and Junctional 
properties are the special technical features that identify each 
individual species and distinguish each species from the others, 
because none of the special technical features is shared by the 
liilcd above. 
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Croup II. ckimu) 20, 23, is drawn to • compound which inhibits 
activation of an hESF polypeptide. 

Croup 111, claim(s) 21, is drawn to a inehtod for the treatment of 
a patient with an hESF polypeptide. 

Croup IV, ckim(s) 22, b drawn to a method for the treatment of 
a patient with a t h c n u x mir amount of an hESF polypeptide. 
Group V, dsim(t) 24, b drawn to a process for diagnosing 
diseases by determining hESF polypeptide expression. 
Group VI, cbum(i) 25, a drawn to a diagnostic to analyze for 
the presence of an hESF polypeptide. 

Group VII. ekiau>) 26, is drawn to a method of kfccUfyutXxxxipoundi binding and inhibiting activation of an 

hESFpolypeptide.Each of Groups U-VT1 contains claims directed to mote than one 

rpecies of the generic invention. These species arc deemed to 

lack Unity of Invention because they are not so linked as 

toformasinglc inventive concept under PCT Rule 13.1. 

Inorderfonnorethan one species to be examined, the appropriate 

additional examination fees must be paid. The species are as 

follows: 

a) the polynuclcotidc/AA encoding hESF I, 

b) the polynucleotidc/AA encoding hESF II, 

c) the poly nucleotide/ AA encoding hESF 11. 

The species listed above do not relate to a single inventive 
concept under PCT Rule 13.1 because, under PCT Rule 13.2. the 
species lack the same or corresponding special technical features 
for the following reasons: The nucelic acid encoding the AA of 
hESF I of species a, the nucelic acid encoding the AA of hESF n 
of species b, and the nucelic acid encoding the AA of hESF III of 
species c. each have materially different chemical structures and 
materially different functional properties. These chemical 
structures and functional properties are the special Sfrhnical 
features that identify each individual species and rf^gy jih 
each species from the others, because none of the special 

technical features is shared by the species lilted above. and it considers that the International Application does not 

comply with the requirements of unity of invention (Rules 13.1.13.2 and 13.3) for the reasons indicated below:The 

inventions listed at Groups I- VII do not relate to a single 

inventive concept under PCT Rule 13.1 because, under PCT Rule 

13.2, they lack the same or corresponding special technical 

features for the following reasons: The polypeptide of group 

l,the inhibiting compound of group II, the methods of treatment 

of groups III and IV, the diagnostics of groupV and VI, and the 

method of identifying an hESF inhibitor of group VII, each have 

materially different chemical structures and materially different 

functional properties. These chemical structures and functional 

properties are the ipectal technical features that identify each 

invention and distinguiih each invention from the others because 

twine ol the special technical features is shared by the separate 

tfruupN. 

Accordingly, 37 CFR 1.475(d) dos NOT provide for multiple 
products/methods within a single PCT application, and the claims 
are not so linked by a tpecial technical feature within the 
meaning of the PCT Ruk 13.2 so as to form a single inventive 
concept. 
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